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Preface 


One of the landmarks in the history of mathematics is the proof of the nonexis- 
tence of algorithms based solely on radicals and elementary arithmetic operations 
(addition, subtraction, multiplication, and division) for solutions of general alge- 
braic equations of degrees higher than four. This proof by the French mathemati- 
cian Evariste Galois in the early nineteenth century used the then novel concept 
of the permutation symmetry of the roots of algebraic equations and led to the 
invention of group theory, an area of mathematics now nearly two centuries old 
that has had extensive applications in the physical sciences in recent decades. 

The radical-based algorithms for solutions of general algebraic equations of 
degrees 2 (quadratic equations), 3 (cubic equations), and 4 (quartic equations) have 
been well-known for a number of centuries. The quadratic equation algorithm uses 
a single square root, the cubic equation algorithm uses a square root inside a cube 
root, and the quartic equation algorithm combines the cubic and quadratic equation 
algorithms with no new features. The details of the formulas for these equations 
of degree d(d = 2,3, 4) relate to the properties of the corresponding symmetric 
groups Sq which are isomorphic to the symmetries of the equilateral triangle for 
d = 3 and the regular tetrahedron for d = 4. 

Related ideas can be used to generate an algorithm for solution of the general 
algebraic equation of degree 5 (the quintic equation). Such a quintic equation 
algorithm does not violate the classical theorem proved by Galois since it contains 
more complicated mathematical functions than the radicals that suffice for the 
algorithms for roots of the general quadratic, cubic, and quartic equations. 

The underlying mathematics for an algorithm to solve the quintic equation was 
developed by nineteenth century European mathematicians shortly after Galois’ 
discovery of the insolubility of the general quintic equation using only radicals. 
The initial work in this area was done by Hermite and then developed further by 
Gordan. This culminated in two key publications, an 1878 article by Kiepert! 
describing a quintic equation algorithm and the classic 1884 book by Klein? de- 
scribing the relationship between the icosahedron and the solution of the quintic 
equation. At that stage this work lay fallow for more than a century since the 
algorithm for roots of the general quintic equation appeared intractable before the 
era of computers. Many of the key ideas appear to have been forgotten by the 
subsequent generations of mathematicians during the past century so that some of 
the underlying mathematics has the status of a lost art. 

The work discussed in this book arose when I discovered the 1878 Kiepert 
paper and wished to see if the quintic equation algorithm described therein would 
work on a modern computer. I enlisted the help of a computer scientist, Prof. E. 
R. Canfield of the University of Georgia, to write a program that would use the 
Kiepert algorithm to solve the quintic equation (i.e., to calculate the roots of the 
quintic equation from its coefficients). This proved to be much more difficult than 
anticipated because of errors and lack of detail in the Kiepert paper. However, 
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the quintic equation algorithm was finally made to work on an IBM-compatible 
personal computer but only after studying a considerable amount of related math- 
ematics in several languages mainly in nineteenth century journals. This work led 
to the discovery of some unanticipated features of the quintic equation algorithm. 

This book presents for the first time a complete algorithm for the roots of 
the general quintic equation with enough background information to make the 
key ideas accessible to nonspecialists and even to mathematically oriented readers 
who are not professional mathematicians. Two relatively short papers**, have 
been published on this work on the quintic equation algorithm but the whole story 

is far too long to fit into a journal article of reasonable length. The book includes 

initial introductory chapters on group theory and symmetry, the Galois theory of 
equations, and some elementary properties of elliptic functions and associated 
theta functions in an attempt to make some of the key ideas accessible to less 
sophisticated readers. The book also includes discussion of the much simpler 
algorithms for roots of the general quadratic, cubic, and quartic equations before 
discussing the algorithm for the roots of the general quintic equation. The book 
concludes with a brief discussion of attempts to extend these ideas to algorithms 
for the roots of general equations of degrees higher than five. 

I am indebted to Prof. E. R. Canfield of the Computer Science Department at 
the University of Georgia for helpful discussions during the varous stages of this 
project. In addition, I would like to acknowledge the patience and cooperation of 
my wife, Jane, during the preparation of this book. 


R. Bruce King 
Athens, Georgia 
March, 1996 
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Chapter 1 
Introduction 


Consider an algebraic equation of degree n, i.e., 
“a 
f@) =aor" + ayx-1 4... ay bye =0 (1-1) 
, i=] 


Algorithms for solving such equations, i.e., determining their roots, X},...,%p, 
as functions of their coefficients, a,, have been of interest since ancient 
times.!,2 Thus the solution of quadratic equations, n = 2, has been known 
since Babylonian times and the “quadratic formula” 


_-ay + Vay2-—4 
ia ae (1-2) 


is familiar to high school students. A more complicated formula for solution of 
general cubic equations (equation 1-1 for n = 3), involving both square and 
cube roots, was found in the 16th century by Cardan, who built on previous 
work by Tartaglia and dal Ferro. Almost immediately thereafter Ferrari showed 
how the algorithms for solution of the quadratic and cubic equations could be 
combined to provide an algorithm for the solution of the general quartic 
equation (equation 1-1 for n = 4). Thus all equations of degree 4 and lower can 
be solved by using only square and cube roots. Solutions of algebraic 
equations (1-1) using functions no more complicated than radicals are called 


radical solutions or algebraic solutions of equations. 


The success in finding radical solutions for all equations of degree 4 or 
lower naturally stimulated a search for radical solutions of the general quintic 
equation (equation 1-1 for n = 5). Naively it might appear that fifth roots 


1M. Kline, Mathematical Thought from Ancient to Modern Times, 
Oxford Univ. Press, New York, 1972, pp. 263-272, 597-606, 752-763. 
23.-P. Tignol, Galois’ Theory of Algebraic Equations, Longman, Essex, 
England, 1988. 
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would be sufficient for solution of any quintic equation just as square and cube 
roots are sufficient for solution of all equations of degree 4 and lower. Indeed 
the special binomial quintic equation of the type 
x —as=0 (1-3) 

is soluble using fifth roots, i.e., x, = vas: just like any other binomial equation 
of degree m, i.e., x”* — am = 0, can be solved using the corresponding mth 
root, i.e., XE = Nam. However, efforts by mathematicians such as Leibniz, 
Tschirnhausen, Euler, Vandermonde, Lagrange, and Ruffini to find a radical 
solution of the general quintic equation (equation 1—1 for n = 5 with ag #0 for 0 
< k <5) all proved to be unsuccessful. Abel (1802-1829) then succeeded in 
proving the impossibility of solving by radicals the general equation of degree 
higher than four.3 Galois (1811-1832) subsequently developed a theory, still 
known as Galois theory, which provided a method for characterizing equations 
that are solvable by radicals.4 Galois’ methods required the development of 
group theory to study the effect of permutations of the roots of the equation on 
functions of the roots. Thus the solution of algebraic equations was the first 
application of group theory, an area of algebra that has subsequently found 
many other applications in physical and mathematical sciences including the 
symmetry of physical objects. For example, the solution of the general quartic 
equation (equation 1—1 for n = 4) can be related to the 24 ( = 4!) elements of 
symmetry in the tetrahedron (Figure 1—1a), and the solution of the general 
quintic equation (equation 1—1 for n = 5) can be related to the 120 (= 5!) 
symmetry elements in the icosahedron (Figure 1~1b). 


3N. H. Abel, Beweis der Unméglichkeit algebraische Gleichungen 
von héheren Graden als dem Viertem allgemein aufzulésen, J. fiir 
Math., 1, 65-84 (1826) = Euvres, 1, 66-94. 

48. Galois, Euvres Mathématiques, Gauthiers-Villars, Paris, 1897, pp. 
33-50. 
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Tetrahedron Icosahedron 


Figure 1-1: The tetrahedron and the icosahedron. 


The insufficiency of radicals for solution of the general quintic equation 
naturally raises the question as to what type of functions are needed for its 
solution. The simplest type of quintic equation not soluble by radicals has the 
so-called Bring-Jerrard form ; 

x ~agx+a5 =0 (1-4). 
The functions required to solve equations of this type as well as more 
complicated equations not solvable by radicals might be viewed as more 
complicated type of radicals, i.e., a hyperradicals, so that the development of 
the theory of such nonradical algebraic numbers might be viewed as an 


interesting branch of algebra. In this connection Hermite>-6 first showed that 


elliptic modular functions provided solutions to the general quintic equation but 
the methods still appeared rather intractable. These ideas were developed 
further by Gordan’ and Kiepert® in attempts to present complete algorithms for 
the solution of the general quintic equation. In a classic book Klein? presents in 


5C. Hermite, Sur la Résolution de 1’Equation du Cinquigme Degré, 
Compt. Rend., 46, 508-S15 (1858). 

6C. Hermite, Sur l’Equation du Cinquiéme Degré, in C. Hermite, 
Cuyvres, Volume Il, pp. 347-424, Gauthier-Villars, Paris, 1908. 


7p, Gordan, Uber die Auflésung der Gleichungen vom Fiinften Grade, 
Math. Ann., 13, 375-404 (1878). 


8L. Kiepert, Auflésung der Gleichungen Fiinften Grades, J. fiir Math., 
87, 114-133 (1878). 


9F, Klein, Vorlesungen iiber das Ikosaeder, Teubner, Leipzig, 1884. 
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detail the relationship between icosahedral symmetry and algebraic methods for 
solution of the general quintic equation. Although Klein’s book? is an 
impressive synthesis of diverse important areas of 19th century mathematics, it 
does not provide information on the properties of the elliptic modular functions 
necessary to solve the general quintic equation. 

The author has long been interested in applications of symmetry to 
chemical problems including icosahedral chemical structures. 10 This led to a 
natural curiosity about the relationship of icosahedral symmetry to the general 
quintic equation and then about the types of functions needed to solve the 
general quintic equation. However, none of the books on elliptic functions 
available in the extensive University of Georgia science library provided any 
information on the use of these functions to solve the general quintic equation. 
The classic paper by Kiepert® provides an algorithm on its last two pages for 
solution of the general quintic equation, but efforts to transfer this algorithm 
directly to a computer program were unsuccessful. Only after a detailed study of 
numerous 19th century mathematics papers in several languages as well as old 
algebra texts has it proven possible to assemble a working algorithm on a 
modern microcomputer for solution of the general quintic equation. This book 
is an effort to present the essential aspects of this classical and apparently 
largely forgotten mathematics in a modern form. 

Since the concepts of group theory and symmetry are so central to the 
solution of algebraic equations, these concepts are first discussed in Chapter 2 
with several geometric illustrations. Chapter 3 then extends the concept of 
symmetry to permutations of the roots of algebraic equations leading to the 
basic ideas of Galois theory. Other key concepts relating to solution of 
algebraic equations are also introduced in Chapter 3, notably the use of 
Tschirnhausen transformations to simplify the form of algebraic equations. 
Chapter 4 discusses the essential properties of elliptic functions and related 
integrals required for the solution of general algebraic equations of degree 5 or 
greater. . 


10R. B. King, The Icosahedron in Inorganic Chemistry, Inorg. Chim. 
Acta, 198-200, 841-861 (1992). 


Introduction 5 


The remainder of the book applies these ideas to the solution of 
algebraic equations. Chapter 5 presents the classical methods for solutions of 
general equations of degrees no greater than 4 where radical methods can be 
applied as well as methods for identification of special quintic equations also 
soluble by radicals. Aspects of these methods pertaining to the structure of the 
symmetric groups S, are given particular attention. Chapter 6 presents the 
details of the Kiepert algorithm® for solution of the general quintic equation 
using theta series. This algorithm has been verified on a microcomputer.!!.12 
Chapter 7 summarizes the earlier methods of Hermite5-© and Gordan? for 
solving the general quintic equation. The final chapter, Chapter 8, goes 
beyond the quintic equation to summarize what is known about methods for 
solution of the general sextic equation, certain special septic equations, and the 
Umemura formula based on higher order theta functions for solution of the 
general algebraic equation of any degree. 


11R,. B. King and E. R. Canfield, An Algorithm for Calculating the 
Roots of a General Quintic Equation from its Coefficients, J. Math. 
Phys., 32, 823-825 (1991). 

12R. B. King and E. R. Canfield, Icosahedral Symmetry and the Quintic 
Equation, Comput. and Math. with Appl., 24, 13-28 (1992). 


_ Chapter 2 


Group Theory and Symmetry 
2.1 The Concept of Groups 


A group is a collection or set of elements together with an inner binary 
operation, conventionally called multiplication, satisfying some special rules 
discussed below. It is not necessary to specify what the elements are in order to 
discuss the group that they constitute. Of particular interest in the context of 
this book are groups formed by permutations of sets of small numbers of 
objects, particularly the roots of algebraic equations; such groups are called 
permutation groups. Frequently the properties of such abstract permutation 
groups are most readily visualized by considering analogous or isomorphic 
groups. formed by sets of symmetry operations on polyhedra or other readily 
visualized spatial objects. Groups formed by symmetry operations on three- 
dimensional objects are called symmetry point groups. Some features of both 
of these types of groups will be discussed in this chapter. 

A set of elements combined with an binary operation (multiplication) to 
form a mathematical group must satisfy the following four conditions or rules!: 
(1) The product of any two elements in the group and the 
square of each element must be an element of the group. A product 
of two group elements, AB, is obtained by applying the binary operation 
(multiplication) to them and the square of a group element, A2, is the product of 
an element with itself. This definition can be extended to higher powers of 
group elements. The multiplication of two group elements is said to be 
commutative if the order of multiplication is immaterial, i.e., if AB = BA. In 
such a case A is said to commute with B. The multiplication of two group 
elements is not necessarily commutative. 

(2) One element in the group must commute with all others and 
leave them unchanged. This element is conventionally called the identity 


IF, J. Budden, The Fascination of Groups, Cambridge Univ. Press, — 
London, 1972. 
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element and often designated as E. This condition may be concisely stated as 
EX =XE =X. 

(3) The associative law of multiplication must hold. This 
condition may be expressed concisely as A(BC) = (AB)C, i.e., the result must 
be the same if C is multiplied by B to give BC followed by multiplication of 
BC by A to give A(BC) or if B is multiplied by A to give AB followed by 
multiplication of AB by C to give (AB)C. 

(4) Every element must have an inverse, which is also an 
element in the group. The element Z is the inverse of the element A if AZ 
= ZA =E. The inverse of an element A is frequently designated by A~!. Note 
that multiplication of an element by its inverse is always commutative. 

These defining characteristics of a group have been summarized 
concisely? by defining a group as “...a mathematical system consisting of 
elements with inverses which can be combined by some operation without 
going outside the system.” : 

The number of elements in a group is called the order of the group. 
This book will be concerned exclusively with finite groups, i.e., groups with 
finite numbers of elements. Within a given group it may be possible to select 
various smaller sets of elements, each set including the identity element E, 
which are themselves groups. Such smaller sets are called subgroups. A 
subgroup of a group G is thus defined as a subset H of the group G which is 
itself a group under the same multiplication operation of G. The fact that H is a 
subgroup of G may be written H CG. The order of a subgroup must be an 
integral factor of the order of the group. Thus if H is a subgroup of G and IHI 
and IGI are the orders of H and G, respectively, then the quotient IGI/IH| must be 
an integer. This quotient is called the index of the subgroup H in G. 

Let A and X be two elements in a group. Then X~!AX = B will be 
equal to some element in the group. The element B is called the similarity 
transform of A by X and A and B may be said to be conjugate. Conjugate 


elements have the following properties: 


2A. W. Bell and T. J. Fletcher, Symmetry Groups, Associated Teachers 
of Mathematics, 1964. 
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(1) Every element is conjugate with itself. Thus for any particular 
element A there must be at least one element X such that A = X~1AX, 

(2) IfA is conjugate with B, then B is conjugate with A. Thus if 
A =X~!BX, then there must be some element, Y, in the group such that B = 
Y-lay. 

(3) IfA is conjugate with B and C, then B and C are conjugate 
with each other. 

A complete set of elements of a group which are conjugate to one another is 
called a class (or more specifically a conjugacy class) of the group. The number 
of elements in a conjugacy class is called its order; the orders of all conjugacy 
classes must be integral factors of the order of the group. 

An important property of each element of a group is its period. In this 
context the period of an element is the minimum number of times that the 
element must be multiplied by itself before the identity element £ is obtained, 
i.e., the smallest positive integer n such that A" = E. The period of the identity 
element E is, of course, 1. The period of an element is sometimes also called its 
order but this is confusing because the term “order” is also used to describe the 
number of elements in a group or conjugacy class (see above). 

Certain elements 21, 22,...,2m of a given finite group G are called a set 
of generators if every element of G can be expressed as a finite product of their 
powers (including negative powers).3 A set of generators may be denoted by 
the symbol {21,22,...2m}. A set of relations satisfied by the generators of a 
group is called an abstract definition or presentation of the group if every 
relation satisfied by the generators is an algebraic consequence of these 
particular relations. A group with only one generator (i.e., m = 1) is a cyclic 
group, {g} = Cp, whose order n is the period of the single generator g, i.e., 
g" = E, where E is the identity element. The cyclic group C| is the trivial group 
consisting solely of the identity element. 

A group in which every element commutes with every other element is 
called a commutative group or an Abelian group after the famous Norwegian 
mathematician, Abel (1802-1829). In an Abelian group every element is in a 


3H. S. M. Coxeter and W. O. J. Moser, Generators and Relations for 
Discrete Groups, Springer-Verlag, Berlin, 1972. 
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conjugacy class by itself, i.e., all conjugacy classes are of order one. A normal 
subgroup N of G, written N AG, is a subgroup which consists only of entire 
conjugacy classes of G.4 A normal chain of a group G is a sequence of normal 
subgroups Cy INg, J Ng, Ng, J INg, AG, in which s is the number 


of normal subgroups (besides Cj and G) in the normal chain. A simple group 
has no normal subgroups other than the identity group Cy. Simple groups are 


particularly important in the theory of finite groups.>-© If a normal chain starts 
with the identity group Cy and leads to G and if all of the quotient groups 


NaylC1 = Cay Nag/Nay = Cay--1G! Ng, = C 


composite or soluble group. 


as4, are cyclic, then G is a 


2.2 Symmetry Groups 


Many of the properties of groups are most readily visualized by 
studying the groups consisting of symmetry operations of polyhedra or other 
concrete three-dimensional objects. In this context a symmetry operation isa 
movement of an object such that, after completion of the movement, every point 
of the body coincides with an equivalent point or the same point of the object in 
its original orientation. The position and orientation of an object before and 
after carrying out a symmetry operation are indistinguishable. Thus a symmetry 
operation takes an object into an equivalent configuration. 

The symmetry operations for objects in ordinary three-dimensional 
space can be classified into four fundamental types each of which is defined by 
a symmetry element around which the symmetry operation takes place. The 
four fundamental types of symmetry operations and their corresponding 
symmetry elements are listed in Table 2-1. 

The identity operation, designated as E, leaves the object unchanged. 
| Although this operation may seem trivial, it is mathematically necessary in order 


4J. K. G. Watson, On the Symmetry Groups of Non-Rigid Molecules, . 
Mol. Phys.,21, 577 (1971). 


5D. Gorenstein, Finite Groups, Harper and Row, New York, 1968. 


6D. Gorenstein, Finite Simple Groups: An Introduction to their 
Classification, Plenum, New York, 1982. 
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to convey the mathematical properties of a group on the set of all of the 
symmetry operations applicable to a given object. The reflection operation, 
designated as O, involves reflection of the object through a plane, known as a 
reflection plane. For example, in a reflection through the xy-plane 
(conveniently designated as O,y) the coordinates of a point (x, y, z) change to 
(x, y, —z}—~a reflection operation thus can result in the change of only a single 
coordinate. A rotation operation, designated as Cy, consists of a 360°/n rotation 
around a line, known as a rotation axis. For example, a C2 rotation around the 
z-axis changes the coordinates of a point (x, y, z) to (—x, —y, z)—a rotation 
operation thus can result in a change of only two coordinates. An improper 
rotation, designated as S,,, consists of a 360°/n rotation around a line followed 
by a reflection in a plane perpendicular to the rotation axis. An S2 operation is 
called an inversion and is designated by i; the intersection of the C2-axis and the 
perpendicular reflection plane is called an inversion center. Inversion through 
the origin changes the coordinates of a point (x, y, z) to (—x, —y, —z)-—thus an 
Sy, operation must change the signs of all three coordinates. An S} improper 
rotation in which the Cy proper rotation component is equivalent to the identity 
E corresponds to a reflection operation 6. Thus the reflection operation 6 is a 


special type of improper rotation, namely $4. 


Table 2—1: The Four Fundamental Types of Symmetry 


Operations 
Symmetry | Corresponding 
Operation Designation Symmetry Element Dimensions 
Identity (no change) E The entire object 3 
Reflection oO Reflection plane 2 
Rotation Ch Rotation axis 1 
Improper rotation Sn Improper rotation axis* 0 


*Point of intersection of a proper rotation axis and a perpendicular reflection 
plane 
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Consider the set of symmetry operations in ordinary three-dimensional 
space describing the symmetry of an actual object. Such a set of symmetry 
operations satisfies the properties of a group in the mathematical sense and is 
therefore called a symmetry point group. In most cases such a symmetry point 
group contains a finite number of operations and is therefore a finite group. 

There is a systematic way to classify objects in three-dimensional space 
by their symmetry point groups based on the following sequence of questions’: 
(1) Is the object linear (i.e., only “one-dimensional’’)? 

Linear objects are the only objects having infinite rather than finite 
symmetry point groups since the linear axis corresponds to an infinite order 


rotation axis, namely C.o.. If there is a reflection plane perpendicular to the » 


infinite order rotation axis (dividing the object into two equivalent halves), then 
the symmetry point group is D.., if not the symmetry point group is Co. 

(2) Does the object have multiple “higher-order” rotation axes 
(i.e., C32 axes)? 

Multiple axes in this question refer to noncollinear C2 axes. Such 
nonlinear objects have the symmetries of well-known regular polyhedra and are 
generally readily recognizable by containing the regular polyhedra in some 
manner. Such groups are sometimes called the polyhedral point groups. Thus 
the tetrahedral groups 7, Tp, and Tg with 12, 24, and 24 operations, 
respectively, have four noncollinear C3 axes and are distinguished by the 
presence or absence of horizontal (G,) or diagonal (Gy) reflection planes. The 
octahedral groups O and Oy with 24 and 48 operations, respectively, have not 
only four noncollinear C3 axes but also three noncollinear C4 axes and are 
distinguished by the presence or absence of reflection planes. Figure 2-1 
shows the multiple rotation axes of the regular octahedron and the cube, both of 
which have the octahedral point group Oj. The icosahedral groups / and J, with 
60 and 120 operations, respectively, have ten noncollinear C3 axes and six 
noncollinear C’s5 axes and are also distinguished by the presence or absence of 
reflection planes. 


7F, A. Cotton, Chemical Applications of Group Theory, Third Edition, 
Wiley, New York, 1990, Chapter 3. 
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38 If the object does not belong to either a linear group or a 
polyhedral group, then does it have proper or improper axes of 
rotation (i.e. Cy, or Sp)? 

If no axes of either type are found, the group is either Cs, Cj, or Cy with 
2, 2, and 1 operations, respectively, depending whether the object has a plane 
of symmetry (G), and inversion center (i), or neither. The C, designation 
corresponds to an object with no symmetry at all and therefore to a symmetry 
point group containing only one element, namely the identity element E. 


C3 
h Cy 
4 
ee oo Ae Cake ateteieleteled > Co 
Octahedron Cube 


Figure 2-1: The multiple rotation axes in the octahedron and cube, both of which have On 
symmetry. In the octahedron the C4 axes pass through opposite pairs of vertices, the C3 axes 
pass through the midpoints (+) of opposite pairs of faces, and the C2 axes pass through 
opposite pairs of edges. In the cube, the C3 axes pass through opposite pairs of vertices, the 
C4 axes pass through midpoints (*) of opposite pairs of faces, and the C2 axes pass through 
opposite pairs of edges. 
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(4) Does the object have an even-order improper rotation axis 
| S2n but no planes of symmetry or any proper rotation axis other 
than one collinear with the improper rotation axis? 
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The presence of an even-order improper rotation axis S2, without any 
noncollinear proper rotation axes or any reflection planes indicates the 
symmetry point group $2, with 2n operations. 

(5) If the object does not belong to the linear point groups, the 
polyhedral point groups, or the point groups Cs, Cy, C1, or San, 
then look for the highest order rotation axis. Call the highest order 
rotation axis Cy. 

(6) Are there n C2 axes lying in a plane perpendicular to the C, 
axis? If there are n C2 axes lying in a plane perpendicular to the Cn axis then 
the object belongs to one of the symmetry point groups Dy, Dpy, or Dag with 
2n, 4n, and 4n operations, respectively, depending on whether there are no 
planes of symmetry, a horizontal plane of symmetry (6y), or n vertical planes of 
symmetry (Oy), respectively. If there are no C2 axes lying in a plane perpen- 
dicular to the Cy, axis, then the object belongs to one of the symmetry point 
groups Cy, Cry, Cnr, with n, 2n, and 2n operations, respectively, depending 
upon whether there are no planes of symmetry, n vertical planes of symmetry 
(0, ), or a horizontal plane of symmetry (0,), respectively. If there are only C2 
axes, then a unique C2 axis is chosen as the “reference axis” if there is any 
ambiguity as to which C9 axis to choose. 


2.3. Regular Polyhedra 


The symmetries of regular polyhedra are important in the theory of the 
solution of algebraic equations. The symmetries of the five regular Platonic 
solids, namely the tetrahedron, octahedron, cube, icosahedron, and dodeca- 
hedron (Figures 2—1 and 2-2) were already recognized by the ancient Greeks. 
The fact that there are only five regular geometric solids in contrast to an infinite 
number of polygons must have been a major revelation to the ancient thinkers. 
The reason for only five regular geometric solids can be readily seen by the 
following arguments.8.9 


8H. S. M. Coxeter, Regular Polytopes, Pitman Publishing Corp., New 
York, 1948. 
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Tetrahedron Icosahedron Dodecahedron 


Figure 2-2: The regular tetrahedron, icosahedron, and dodecahedron. The regular 
octahedron and cube are shown in Figure 2-1. 
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In order to have the solid angles required for forming closed polyhedra, 
a minimum of three edges must meet at a vertex. In this connection the number 
of edges meeting at a vertex is called the degree of the vertex. In the case of 
completely regular polyhedra the configuration of edges meeting at each vertex 
must be identical. If three equal-length edges meet at every vertex and if they 
join each other in equilateral triangular faces, the result is a tetrahedron. If 
instead, four or five equilateral triangles meet at each vertex, the result is an 
octahedron or an icosahedron, respectively. All of these regular polyhedra with 
equilateral triangular faces may be considered to have their vertices on finite 
surfaces with positive curvature such as the sphere. If six equilateral triangles 
meet at a vertex, an infinite flat two-dimensional lattice is obtained with a 
familiar hexagon structure with its vertices on an infinite surface with zero 
curvature. Such planar structures are called ressellations (Figure 2-3). More 
than six equilateral triangles meet at a vertex only with puckering and thus 
cannot give a regular solid. The vertices of such configurations can only be 
placed on infinite surfaces with negative curvature. Thus there are only three 
regular polyhedra with (equilateral) triangular faces, namely the tetrahedron, 


90. T. Benfey and L. Fikes, The Chemical Prehistory of the 
Tetrahedron, Octahedron, Icosahedron, and Hexagon, Adv. Chem. 
Ser., 61, 111-128 (1966). 
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octahedron, and icosahedron. Such polyhedra with triangular faces are called 
deltahedra since their faces are shaped like the Greek letter delta, A. 


ee 


Hexagonal tessellation Tessellation of squares 
of triangles (“checkerboard plane”) 


Tessellation of hexagons 


Figure 2-3: Examples of tessellations on a flat plane. 


Additional regular solids can be formed from square or regular 
pentagonal faces. Vertices where three squares meet generate a cube whereas 
vertices where four squares meet generate a checkerboard plane (Figure 2-3). 
Vertices where three regular pentagons meet generate a regular dodecahedron; 
regular pentagons cannot form a tessellation. Regular hexagons with 120° 
angles can only form a tessellation and regular polygons with more than six 
sides have angles larger than 120° and thus cannot meet with two other equi- 
valent polygons to form a solid angle at all. 

The properties of the regular polyhedra are summarized in Table 2-2. 
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The regular polyhedra provide examples of pairs of dual polyhedra. 10 
In this connection a given polyhedron P can be converted into its dual P* by 
locating the centers of the faces of P* at the vertices of P and the vertices of P* 
above the centers of the faces of P. Two vertices in the dual P* are connected 
by an edge when the corresponding faces in P share an edge. The process of 
dualization has the following properties: 
(1) The numbers of vertices and edges in a pair of dual polyhedra P and P* 
satisfy the relationships v* =f, e* = e, f* = v. 
(2) Dual polyhedra have the same symmetry elements and thus belong to the 
same symmetry point group. 
(3) Dualization of the dual of a polyhedron leads to the original polyhedron. 
(4) The degree of a vertex in a polyhedron corresponds to the number of 
edges in the corresponding face in its dual. 
Examples of pairs of dual polyhedra among the regular polyhedra are the 
octahedron/cube dual pair with Oy, symmetry (Figure 2—4) and the 
icosahedron/dodecahedron dual pair with J, symmetry. The tetrahedron with Tg 
symmetry is self-dual (i.e., dual to itself). For example, a degree 3 vertex of a 
polyhedron corresponds to a triangular face in its dual. 


Table 2-2: Properties of the Regular Polyhedra 


Face Vertex Numberof Numberof Number of 
Polyhedron Type Degrees Edges Faces Vertices 
Tetrahedron Triangle 3 6 4 4 
Octahedron __ Triangle 4 12 8 6 
Cube Square 3 12 6 8 
Dodecahedron Pentagon 3 30 12 20 
Icosahedron _ Triangle ‘5 30 20 12 


108, Griinbaum, Convex Polytopes, Interscience, London, 1967, pp. 46- 
48. 
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dualization 
P P* 
Octahedron Cube 
v=6, e= 12, f=8 v=8,e=12,f=6 


Figure 2—4: The process of dualization to convert an octahedron into a cube. 


The regular icosahedron and its dual, the regular dodecahedron, have 
the J, symmetry point group with 120 elements. Its pure rotation subgroup, /, 
with 60 elements is isomorphic with the alternating group A5. The symmetry 
point group of the regular tetrahedron, Ty, is a subgroup of J, of index 5 but not 
a normal subgroup. Nevertheless, the ability to partition an object of icosa- 
hedral symmetry into five equivalent objects of tetrahedral or octahedral 
symmetry (O, ~ Tq X C2) is an essential part of the Kiepert algorithm for 
solution of the general quintic equation (Chapter 6). 

There are several ways of visualizing the partition of an object of icosa- 
hedral symmetry into five equivalent objects of at least tetrahedral symmetry 
thereby separating the effect of the fivefold axis of the icosahedron from that of 
its twofold and threefold symmetry elements. The 20 vertices of the dual of the 
icosahedron, namely the regular dodecahedron, can be partitioned into five sets 
of four vertices, each corresponding to a regular tetrahedron. Using the same 
idea Klein!! partitions the 30 edges of an icosahedron into five sets of six 
edges each by the following method (Figure 2-5): 

(1) A straight line is drawn from the midpoint of each edge through the center 
of the icosahedron to the midpoint of the opposite edge; 

(2) The resulting 15 straight lines are divided into five sets of three mutually 
perpendicular straight lines. 


11F, Klein, Vorlesungen iiber das Ikosaeder, Teubner, Leipzig, 1884, 
Part I, Chapter I, § 8. 


18 Beyond the Quartic Equation 


Each of these five sets of three mutually perpendicular straight lines resembles a 
set of Cartesian coordinates and defines a regular octahedron. The construction 
is the dual of a construction depicted by Du Val!? in color in which a regular 
dodecahedron is partitioned into five equivalent cubes. Alternatively, the 30 
edge midpoints of an icosahedron can be used to form an icosidodecahedron 
having 30 vertices, 60 edges, 32 faces, and retaining icosahedral symmetry. 
The 30 vertices of this icosidodecahedron can then be partitioned into five sets 
of six vertices, each corresponding to a regular octahedron (Figure 2-6). 


Figure 2~5: A regular icosahedron with its 30 edges partitioned into five sets of six edges 
each so that the midpoints of the edges in each set form a regular octahedron. The five sets of 
six edges each are indicated by the following types of lines: ———-saummm nnn nari - - ~~ 


Figure 2-6: The icosidodecahedron formed from the 30 edge-midpoints of a regular 
icosahedron and the partitioning of its 30 vertices into five sets of six vertices, each 
corresponding to regular octahedra. The six vertices in one of these sets are indicated by 
spades (4). 


12p. Du Val, Homographies, Quaternions, and Rotations, pp. 27-30 and 
Figs. 12-15, Oxford University Press, London, 1964. 
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2.4 Permutation Groups 


The previous sections apply the concepts of group theory to symmetry 
point groups such as those describing the symmetry of readily visualized 
polyhedra. The concepts of group theory, of course, can be applied to more 
abstract sets such as the permutations of a set X of n objects, which, for 
example, may be the n roots of an algebraic equation of degree n. A set of 
permutations of n objects (including the identity permutation) with the structure 
of a group is called a permutation group of degree n.'3 Let G be a permutation 
group acting on the set X . Let g be any operation in G and x be any object in 
set X. The subset of X obtained by the action of all operations in G on x is 
called the orbit of x. The operations in G leaving x fixed is called the stabilizer 
of x; it is a subgroup of G and may be abbreviated as Gy. A transitive 
permutation group has only one orbit containing all objects of the set X. Sites 
permuted by a transitive permutation group are thus equivalent. Transitive 
permutation groups represent permutation groups of the highest symmetry and 
thus play a special role in permutation group theory. 

The maximum number of distinct permutations of n objects is n!. The 
corresponding group is called the symmetric group of degree n and is 
traditionally designated as S,, (not to be confused with the designation S, for an 
improper rotation of order n in Section 2.2). The symmetric group Sy, is 
obviously the highest symmetry permutation group of degree n. All permuta- 
tion groups of degree n must be a subgroup of the corresponding symmetric 
group Sy. 

Let us now consider the structure of permutation groups. In this con- 
nection a permutation P;, of n objects can be described by a 2 x n matrix of the 
following general type where the top row represents site labels and the bottom 
row represents object labels: 


ig a) 


Pa=\ pi po p3 .-- Pn 


(2.4-1) 


13N. L. Biggs, Finite Groups of Automorphisms, Cambridge University 
Press, Leadon, 1971. 
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The numbers pj, p2, P3,-.-. Pn can be taken to run through the integers 1, 2, 
3,..., in some sequence. For a given n there are n! possible different Py 
matrices. The matrix P,9 in which the bottom row Pj, p2, P3,---, Pn has the 
integers in the natural order 1, 2, 3,...,.2 (ie., the bottom row of P,® is 
identical to the top row) can be taken to represent a reference configuration 
corresponding to the identity element in the corresponding permutation group. 

Permutations can be classified as odd or even permutations based on 
how many pairs of numbers in the bottom row of the matrix P, are out of their 
natural order. Alternatively, if the interchange of a single pair of numbers is 
called a transposition, the parity of a permutation corresponds to the parity of 
the number of transpositions. Thus a permutation which is obtained by an odd 
number of transpositions from the reference configuration is called an odd 
permutation and a permutation which is obtained by an even number of trans- 
positions from the reference configuration is called an even permutation. The 
identity permutation corresponding to the reference configuration has zero 
transpositions and is therefore an even permutation by this definition. 

A group can be defined relating the P, matrices for a given n. First 
redefine the rows of Py, so that the top row represents the reference 
configuration P,9 and the bottom row represents the object labels in any of the 
n! possible permutations of the n objects. These permutations form a group of 
order n! with the permutation leaving the reference configuration unchanged 
(i.e., that represented by P,9 as so redefined) corresponding to the identity 
element, E. This permutation group is the symmetric group, S,, of order n! as 
defined above. 

Now consider the nature of the operations in a symmetric permutation 
group Sy. These operations are permutations of labels which can be written as 
a product of cycles which operate on mutually exclusive sets of labels, e.g. 


( 4 3 i 3 gs (1 2 4)G 5)(6) (2.42) 


The cycle structure of a given permutation in the symmetric group 5, can be 
represented by a sequence of indexed variables, i. €., x1%2%3 for the 
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permutation in equation 2.4-2. A characteristic feature of the symmetric 
permutation group Sy for all n is that all permutations having the same cycle 
structure come from the same conjugacy class.14 Furthermore, no two 
permutations with different cycle structures can belong to the same conjugacy 
class. Therefore, for the symmetric permutation group S, (but not necessarily 
for any of its subgroups) the cycle structures of permutations are sufficient to 
define their conjugacy classes. Furthermore, the number of conjugacy Classes 
of the symmetric group S, corresponds to the number of different partitions of 
n where a partition of n is defined as a set of positive integers i 1,/2,.. 1k whose 
sum is n ene 2.43). 


Pi =n — (2.4-3) 


An alternative presentation of conjugacy class information for the 
symmetric groups S, is given by their cycle indices.15.16,17 A cycle index 
Z(S,) for a symmetric permutation group S,, is a polynomial of the following 


form: 
I=c 
ZS) = Yi ajxyCilxg“i2. . .xyCin (2.4-4) 


{= 


In equation 2.44 c = number of conjugacy classes (.€., partitions of n by 
equation 2.43), aj = number of operations of S, in conjugacy class i, xj = 
dummy variable referring to cycles of length j, and cjj = exponent indicating the 
number of cycles of length j in class i. These parameters in the cycle indices of 
the symmetric groups S, must satisfy the following relationships: 

(1) Each of the n! permutations of S, must be in some class, 1.e., 


14c Dp. H. Chisholm, Group Theoretical Techniques in Quantum 
Chemistry, Academic Press, New York, 1976, Chapter 6. 

15G. Pélya, Kombinatorische Anzahlbestimmungen fir Gruppen, 
Graphen, und Chemische Verbindungen, Acta Math., 68, 145 (1937). 
16G. Pélya and R. C. Reed, Combinatorial Enumeration of Groups, 
Graphs, and Chemical Compounds, Springer Verlag, New York, 1987. 
17N. G. Debruin in Applied Combinatorial Mathematics, E. F. 
Beckenbach, Ed., Wiley, New York, 1964, Chapter 5. 
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= 
Eai= nt (2.4-5) 
i= 


(2) Each of the n objects being permuted must be in some cycle of each 
permutation in S, (counting, of course, fixed points of cycles of length 1 
represented by x1°1), ie., 


J=n 


2 Ji =nforlsisc (2.46) 
j= 


The relationship between symmetry point groups and permutation 
groups as well as some important concepts can be illustrated by the symmetry 
point groups of the tetrahedron and trigonal bipyramid and the corresponding 
permutation groups on their vertices (Figure 2.7). The permutation group of 
the tetrahedron vertices is a transitive group since all four vertices are equi- 
valent. However, the permutation group of the trigonal bipyramid vertices is an — 
intransitive group partitioning its five vertices into two orbits, namely the two 
axial vertices (A in Figure 2.7) and the three equatorial vertices (E in Figure 


2:1): 


C3 
A 
A 

E7 : 

Bare | Aisa 
E MS E----m Cp 
™A 
Teirahedron Trigonal Bipyramid 


Figure 2.7: The tetrahedron and the trigonal bipyramid showing the proper and improper 
rotation axes (Note that S4 refers to an improper rotation axis of order 4 rather than the 
symmetric permutation group on four objects). The axial (A) and equatorial vertices (E) of the 
trigonal bipyramid are also shown. : 
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Let us now consider some of the properties of the symmetric 
permutation groups, S,. Any symmetric permutation group S, has a normal 
subgroup of index 2 (and thus of order n!/2) consisting of only the 
permutations of even parity (which necessarily includes the identity 
permutation). This special subgroup of S,, is called the alternating group and is 
designated as Aj. The symmetric permutation group S4 is isomorphic with the 
full tetrahedron symmetry point group of order 4! = 24, Tg, (Figure 2.7) 
whereas the alternating permutation group Aq is isomorphic with the tetrahedral 
proper rotation subgroup T of order 4!/2 = 12. The corresponding chain of 
normal subgroups for Sq is Cy <1 C2 <1 D2 Aq =T <S4 =Tq with the 
quotient groups C2/Cy = C2; D2/C2 = C2; A4/D2 = C3; S4/Ag = C2. 

The problem of solving the general quintic equation relates to the fact 
that the alternating group As is a simple group. This can be proved by showing 
that As has no normal subgroups and that As is the only normal subgroup of 
Ss. The proof of this fact can be related to the symmetries of the icosahedron 
(e.g., Figure 2.5). Thus the 60 permutations of As can be partitioned into the 
following classes: 

(1) The identity; 

(2) 24 cycles of period 5 such as (12345), each of these corresponding toa 
rotation through 2nk/5 (k = 1, 2, 3, 4) around a diameter passing through 
opposite vertices of the icosahedron; 

(3)  20cycles of period 3 such as (123), each of these corresponding to a 
rotation through +2n/3 around a diameter passing through midpoints of 
opposite faces of the icosahedron; 

(4) 15 double transpositions of period 2 such as (12)(34) corresponding to 
half turns around a diameter passing through midpoints of opposite edges of the 
icosahedron. 

A normal subgroup of a group G consists of entire conjugacy classes of 
G. For this reason the order of a normal subgroup of A5 must be of the form 1 
_ + 24ns + 20n3 + 15n2 where ns, n3, and n2 are each 0 or 1 (but not all 0 nor all 
1). But this number must be a factor of 60 and thus must be 30, 20, or 15, 
which is clearly impossible. Thus the alternating group A5 cannot have any 
normal subgroups. 
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A similar argument can be used to show that As is the only normal 
subgroup of S5. Furthermore all Ay (n 2 6) can also be shown to be simple 
groups. 

The transitive groups of low degrees are particularly significant in the 
theory of permutation groups and the theory of solution of algebraic equations. 
The transitive groups of degrees up to eleven have been tabulated and their 
properties are given in detail.!8 All of the transitive permutation groups of 
degree $7 are listed in Table 2-3. 

The following points about Table 2—3 are of interest: 

(1) In the groups Cy, the order is the same as the number of classes, 
therefore these groups are Abelian. 

(2) The groups C3, C4,A4=7,C5,A5= J, and Ag contain only even 
operations. 

(3) The groups Ms and M7 are metacyclic groups of degrees 5 and 7, 
respectively. For prime n, metacyclic groups of order n(n + 1) are the largest 
permutation groups of degree n which are soluble groups. A metacyclic group 
is defined in terms of two generators s and ¢ and the relationships sP = P-1=E 
and t-lst =s" where p is a prime (5 in the case of M5) and r is a primitive root 
(mod p) which is 2 when p = 5.3 

(4) The dihedral group D3 can be a transitive permutation group of either 
degree 3 or 6. Similarly, the tetrahedral rotation group T of order 24 can bea 
transitive permutation group of either degree 4 (e.g., the faces of a tetrahedron) 
where it is the alternating group Aq or degree 6 (e.g., the edges of a 
tetrahedron). In addition, the icosahedral rotation group J of order 60 can be a 
transitive permutation group of either degree 5 where it is the alternating group 
As (see Section 2.3 and Figures 2-5 and 2-6 for more details) or degree 6 
permuting the six diameters of the icosahedron connecting pairs of antipodal 


vertices. 


18G. Butler and J. McKay, The Transitive Groups of Degree up to 
Eleven, Communications in Algebra, 11, 863-911 (1983). 
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Table 2—3: Transitive Permutation Groups of Degrees Less than 


Eight 
Group Degree Order Number of Classes _ Special Properties 
C3 3 3 3 Even, Abelian 
D32 83 3 6 3 Dihedral, Symmetric 
C4 4 4 4 Abelian 
D2 4 4 4 Even, Abelian 
Dog 4 8 3 
Ag=T 4 12 4 Even 
S4=Tq 4 24 > Symmetric 
C5 5 5 5 Even, Abelian 
Ds 5 10 4 Dihedral 
Ms 5 20 5 Metacyclic 
As=!1 5 60 5 Even, Simple 
S5 5 120 7 Symmetric,Not Ip! 
C6 6 6 6 Abelian 
Ds 6 6 3 
Deg 6 12 3 Dihedral 
Ag=T. 6 12 4 Even 
6 18 9 
6 24 8 
6 24 5 Even 
6 24 5 Isomorphic to Tg 
6 36 9 
6 36 6 
6 48 10 
L(2,5) =1 6 60 5 
6 dz 9 
6 120 7 Isomorphic to S5 
A6 6 360 7 Even, Simple 
S6 6 720 11 Symmetric 
C7 7 7 7 Even, Abelian 
D7 7 14 5 Dihedral 
7 21 5 Even 
M7 7 42 7 Metacyclic 
L@,2) 7 168 6 Even, Simple 
Aq 7 2520 9 Even, Simple 


~] 


S7 5040 15 Symmetric 
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2.5 Polyhedral Polynomials 


The usual type of convex polyhedra, including the regular polyhedra 
discussed in Section 2.3, can be represented as points on the surface of a 
sphere. The same sphere, taken as a special unit sphere called the Riemann 
sphere, can also be used to represent the complex numbers z = a + bi thereby 
providing a method for generating special polynomials associated with a given 
polyhedron. First represent the complex numbers z =a + bi by points on the 
(x,y) plane with the x-coordinate corresponding to a and the y-coordinate 
corresponding to b; such a plane is called an Argand plane (Figure 2—8a). The 
Argand plane can correspond to the equatorial plane of the Riemann sphere, 
(Figure 2-8b) in which the north pole is e and the south pole is 0; the Q--o 
axis can be called the polar axis.19 The equation of the Riemann sphere is 
taken to be p2 + q2 +72 =1. 


a axis 
North pole (ce 
y (a,b) eee 
Equator 
x 
‘ South pole(0) 
Argand plane Riemann sphere 


Figure 2-8: (a) The Argand plane indicating the point for z= a + bi; (b) The Riemann 
sphere. 


The polyhedral polynomials are polynomials whose roots correspond to 
the locations of the polyhedral vertices, the midpoints of the polyhedral edges, 
or the midpoints of the polyhedral faces on the surface of the Riemann sphere. 


19F. Klein, Vorlesungen iiber das Ikosaeder, Teubner, Leipzig, 1884, 
Part I, Chapter II. 
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Polyhedral polynomials are frequently expressed in terms of homogeneous 
variables so that all terms are the same combined degree in two variables, i.e., 


n n 
Santi = Yi aniuivr-i (2.5-1) 
i=] i=1 


The use of homogeneous variables corresponds to the substitution x = u/v. The 
special symmetry of the regular polyhedra corresponds to the vanishing of some 
special functions of these polyhedral polynomials known as transvectants 
(Section 2.6). 

Now consider the projection of points on the Riemann sphere onto its 
equatorial plane, taken as an Argand plane as noted above (Figure 2-8). A 
complex number z = a + bi gives 


iat ae ei = . . _p+ig 
Q=T 7:58 = L, ath; (2.5-2) 
Solving for p, g, and r gives 
2a 2b _-l+a*+b? 


r (2.5—3) 


PO Tal abe! Peete be hee Be 


Every rotation of the Riemann sphere around its center corresponds to a linear 
substitution 


_ozt+B 
yz+6 


(2.5—4) 


For a rotation of the sphere by an angle 9 where p, q, r and ~p, ~—g, —r remain 


constant, 


V+ tu)z — (t — is) 


~ (t + is)z + (v —in) (2.5—5) 


Zz 


where s = p sin(@/2), t = q sin(®/2), u =r sin(®/2), and v = cos (0/2) so that 
set Ptyr+y2=1 (2.5-6) 


| 
4 
; 
| 
i 


Lcasinmreiatoemsnbitegs Cagis 'periervemnneine passione: Sota econ ne tn otdeescanneee nMET 6, 
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For a rotation about the polar axis this reduces to 

z' = el8z ' (2.5-7) 

Now consider the vertices of a regular octahedron and a regular 
icosahedron as points on the surface of such a Riemann sphere oriented such 
that the north pole (z = ©) is one of the vertices in each case. This leads to the 
following homogeneous polynomials for these regular polyhedra in these 
orientations where z is taken to be u/v.20 

(a) Octahedron (O, symmetry): 


Vertices: t = uv(u4 — v4) (2.5—8a) 

Edges: x = u!2 — 33u8v4 — 33u4v8 + yl2 (2.5-8b) 

Faces: W = u8 + 14u4v4 + v8 (2.5—-8c) 

(b) Icosahedron ({, symmetry): 

Vertices: f = uv(ul0 + 11u5y5 — y!0) (2.5~9a) 
Edges: T = 430 + 522u25y5 — 10,005 u20y!0 — 10,005 y10y20 

— 522 usy25 + y30 (2.5-9b) 

Faces: H = —u20 + 228 ul5y5 — 494 y10y10 — 228 ySy15 — y20 

(2.5~—9c) 


The roots of these polyhedral polynomials correspond to the locations of the 
vertices, edge midpoints, and face midpoints on the Riemann sphere. Their 
degrees are equal to the numbers of corresponding elements (vertices, edges, or 
faces). 

The following features are of interest concerning the polyhedral 
polynomials such as those in equations 2.5-8 and 2.5-9: . 
(1) The vertices of a regular polygon with n vertices in the equatorial plane 
with one vertex at z = 1 correspond to the n roots of unity. 
(2) Derivation of the vertex polynomial of the octahedron, t(u,v), 1S 
particularly obvious. Orientation of an octahedron so that two of the vertices 
are at the poles (z = 0,°°) and one vertex is at z = 1 (Figure 2~9) forces the other 
three vertices to be at z=—1, i, and ~i so that 

c=22—-1(2+1z-DZtd) =2e4-DY= uv(u4 — v4) (2.5—10) 


after converting to homogeneous variables by z = u/v. 


20L. E. Dickson, Modern Algebraic Theories, Sanborn, Chicago, 1930, 
Chapter 13. 


Group Theory and Symmetry 29 


Figure 2~9: Orientation of the regular octahedron inside the Riemann sphere. 


(3) The vertex and face polynomials of a polyhedron correspond to the face 
and vertex polynomials, respectively, of its dual. Thus the vertex polynomial t 
(equation 2.5—8a) and face polynomial W (equation 2.5~8c) of the octahedron 
corresponds to the face polynomial and vertex polynomial, respectively, of its 
dual, namely the cube. 

(4) A cube can be composed of two mutually dual tetrahedra, whose edges 
correspond to face diagonals of the cube (Figure 2-10). A single tetrahedron in 
this orientation has the following polynomial functions: 


Vertices: ® = 14 + 2V-3 u2v2 + v4 (2.5—11a) 
Edges: t= uv(u4 — v4) (2.5—11b) 
Faces: Y = u4 — 2V-3 u2v2 + v4 (2.5-11c) 


Note that the tetrahedron edge polynomial (equation 2.5—11b) is the same as the 
octahedron vertex polynomial (equation 2.5—8a) since the midpoints of the six 
edges of the tetrahedron in this orientation are in the same locations (z = 0, °, 
+1, +i) as the six vertices of the standard octahedron. Furthermore, the face 
polynomial Y of the tetrahedron is the same as the vertex polynomial of its 
dual. Since the four vertices of the tetrahedron and the four vertices of its dual 
together make the eight vertices of a cube, the product of the vertex functions of 
the two tetrahedra equals the vertex function of the cube made by the two 
_ tetrahedra OY = W. 
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Figure 2-10: Decomposition of a cube into two tetrahedra, whose vertices are indicated by 
A and B. The edges of the A tetrahedron are shown to the extent that they are visible. 


(5) The special symmetries of the regular deltahedra lead to the following 


identities: 
Tetrahedron: 12V—3 t? - 03 + ¥3=0 (degree 12) (2.5—12a) 
Octahedron: 108 14 — W3 + y2=0 (degree 24) (2.5—12b) 
Icosahedron: 1728 f ~ H3 —T2=0 (degree 60) (2.5-12c) 


Note that the degrees of the left side of the identities (2.5-12) in (u,v) 
correspond to the orders of the pure rotation groups of the corresponding 
deltahedra, which are isomorphic to the permutation groups relevant to the 
solution of algebraic equations, i.e., O ~ 54 for the quartic equation and I = A5 
for solution of the quintic equation. In this connection the icosahedral identity 
(equation 2.5—12c) will be seen to be important for the solution of the quintic 
equation. 


2.6 Transvectants of Polyhedral Polynomials 


The polyhedral polynomials in homogeneous form can be related by 
their transvectants derived from their partial derivatives by means of invariant 
theory.21.22 In this connection the nth transvectant of two homogeneous 
polynomials f(x,y) and g(x,y), designated as (f,g)" is defined by the equation 


213. H. Grace and A. Young, The Algebra of Invariants, Cambridge, 
1903. 

220. E. Glenn, A Treatise on the Theory of Invariants, Ginn and Co., 
Boston, 1915. 
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avr (, dg (x,y) 
(fg) Se staan Gren aelayr-t) (2.6—-1) 


Of particular interest are the transvectants of a polynomial with itself, i.e., 
Gy)". 

Odd transvectants of the type (f/)" (n = 1, 3, 5, 7,...) vanish since they 
have even numbers of terms of alternating sign which cancel out completely 
corresponding to the (—1)* factor in equation (2.6—-1). The first transvectant 
(f,g)! is also known as the functional determinant of f and g, since it can be 
expressed by the determinant 


ae 
(fg)! = (2.6—2). 


St 3 


The second transvectants (f,g)* are also known as Hessians and can also be 
expressed by the determinant 


af fF 
ax2 oxdy 
(fg)? = (2.6-3). 
og og 
dydx dy2 


Let v, e, and f, be the number of vertices, edges, and faces of regular 
deltahedra, namely the tetrahedron, octahedron, and icosahedron. The 
functional determinants and Hessians relate the vertex (u,v) of degree v), 
edge (E(u,v) of degree e), and face (F(u,v) of degree f) functions by the 
following relationships where ky y and ky ¢ are integers: 
(VV? =kyy F (2.6-4a) 
(VAL =k yt | (2.6-4b) 
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The fact that each differentiation step lowers the degree of the polynomial by 1 
can be used to relate equation (2.6-4b) to Euler’s theorem? as follows: 
deg(V,F)! = (v -— 1) + - 1) =deg(H =e 


= v +f—2 =e corresponding to Euler’s theorem. (2.6-5) 
Similarly from equation (2.6-4a) 

deg (V,V)2 = 2(v -2) = 2v-4=f (2.6-6). 
For a deltahedron in which all faces are triangles 

3f=2e = e=3/of (2.6~7) 
so that Euler’s theorem becomes 

v+f-2=3)f>2w-4=f (2.6-8) 


Note the resemblance of equations (2.66) and (2.6-8). 

A special feature of the vertex polynomials for regular deltahedra, 
namely t of equation (2.5—8a) for the octahedron, f of equation (2.5—9a) for the 
icosahedron, and ® of equation (2.5—11a) for the tetrahedron is the identical 
vanishing of their fourth transvectants, i.e., (t,1)4 = 0, (£4 = 0, and (®,)4 = 
0, respectively. This is a special indication of the symmetry of the regular 
deltahedra. 

Figure 2—11 illustrates the procedure for calculating the second and 
fourth transvectants (t,1)* and (t,t)* of the octahedral vertex function Tt = 
uv(u4—y4). Note that the Hessian (t,t)2 is a multiple of the corresponding face 
function, namely —25W whereas the fourth transvectant (t,t)* vanishes 


identically. 


23B. Griinbaum, Convex Polytopes, Interscience Publishers, New 
York, New York, 1967, pp 130-142. 
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Figure 2~11: Illustration of the calculation of the second transvectant (t,t) and the fourth 
transvectant (1,7)4 of the octahedral vertex function t= uv(u4—v4). Subscripts indicate 
differentiation variables and d, and 0, indicate differentiation with respect to u and v, 
respectively. 
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Chapter 3 


The Symmetry of Equations: Galois Theory 
and Tschirnhausen Transformations 


3.1 Rings, Fields, and Polynomials 


The previous chapter uses group theory to study spatial and permuta- 
tional symmetry. This chapter extends the application of group theory to the 
symmetry inherent in algebraic equations, which is closely linked to the 
methods required for their solution. The group-theoretical aspects of algebraic 
equations were first introduced by Evariste Galois (1811-1832) so that this area 
of mathematics is frequently called Galois theory. An excellent discussion of 
Galois theory is given in a book by Stewart.! 

The concepts of a ring, an integral domain, and a field are essential to 
Galois theory. Such concepts express in a rigorous way the properties of 
numbers of various types and allow a precise definition of the concept of a 
polynomial corresponding to the left side of the general algebraic equation 


n 
f(x) =agx" + apr! +... dn = Yianit! =0 (3.1-1). 
| i=l 


These concepts are defined as follows: 

(1) Ring: A ring is defined as a set R equipped with two binary operations, 
namely addition (+) and multiplication (x), such that R is an Abelian group 
under addition, multiplication is associative, and the following distributive laws 
hold for alla, be R: 


(a+b)xc=(axc)+(bxc) (3.1-2a) 

ax(bt+c)=(axb)+(axc) (3.1—2b) 
The operation of addition has an additive identity called 0 such that 

a+O=a (3.1-3). 


The operation of multiplication a x b may be abbreviated as ab. Note that a ring 
has two binary operations whereas a group (Section 2.1) has only a single 
binary operation. 


1], Stewart, Galois Theory, Chapman and Hall, London, 1973. 
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(2) Integral domain: An integral domain is a ring D with the following 
three additional properties: 
(a) Multiplication in D is commutative, i.e., ab = ba for all a, b. 
(b) There exists an element 1 € D such that al = Ja=a forallae D; 
(c) Ifab=0 fora, bé D then either a =0 or b=0. 
(3) Field: A field is a ring F such that F\{0} (ie., the field without the 
additive identity) is an abelian group under multiplication. Thus every non-zero 
element a € F has a multiplicative inverse a-! where ab-! may also be written as 


bo a 
division, i.e., a/b or Be 


Frequently encountered examples of such structures include the integral 
domain Z of integers and the fields Q, R, and C of rational, real, and complex 
numbers, respectively. 

The concepts of subring, subfield, and ideal are also useful. A subring 
of a ring R is a nonempty subset S such that if a,b € S, thena+be S,a-—b 
e S,andabe S. A subfield of a field F is a subset S containing the elements 0 
and 1 such that if a,b € S, thena+be S,a—be S,abe S, and further if a 
#0, then a-! e€ S. An ideal of aring R is a subring J such that ifie Jandre 
R then ir and ri are in J. Thus Z is a subring of Q, and R is a subfield of C 
while the set 2Z of even integers is an ideal of Z. If / is an ideal of the ring R, 
the quotient ring R// consists of the cosets of J in R with the following 
operations where r,s € R and/ +r isthe coset {i+r:ie J}: 

dV+rn+U+s)=1+(r+5) (3.1—4a) 

U+r)\7 +s) =1+ (rs) (3.1-4b) 
Thus the set of integers divisible by a fixed integer n, namely nZ, is an ideal of 
2 and the quotient ring Z, = Z/nZ is the ring of integers modulo n (or mod n). 
The elements of the ring Z, are conventionally written as 0, 1, 2,...,n—1. 

The following property of Z, is important: 

Theorem 3.1-1: The ring Z,, is a field if and only if 1 is a prime number. 

Proof: (a) Suppose n is not prime. If n = 1, then Z, = Z/Z which has only 
one element and thus cannot be a field. If n > 1, then n = rs where r and s are 
integers less than n. If/ =n Z, then (7 +r) +s) =/ + rs =I by equations 
3.1-4. But / is the zero element of Z// while / + r and J =5 are non-zero. 


Oe teh bone Lcee e. LEGE ENP bi man se Aenea btcandz nacre inssieilhmananante lapis weirs dnt: Dbieh diatitsteaaiin mba, acts iads taesmmamacefrmmmnces imeem teases th emma tanta tana Boma hp le aftpopnacicaats il ts inblthaifpine dist nitpeaoe:Ubdndddiitiaipaastttaen otto, sep rttieytag. uae 4 salsa ipo. e we PPE aC dict Mat Cals tail PME NAA Ts RiP OOF aE erate DDE a atte AAG FP nd elatiary ste nine Natta Dab beEDae 5 PORE al ial IAL ES ie. 
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Since in a field the product of two non-zero elements is non-zero, 2/I cannot be 
a field. 

(b) Suppose n is prime. Let J +r be a non-zero element of 2/I. Since r and n 
are coprime, there exist integers a and b such that ar + bs = 1. Then 
G+ad+n=C+1-W+tnd+b)=1+1landd+n¢+a)=/ +1. Since 
I + 1 is the identity element of Z//, we have found a multiplicative inverse for 
the given element J + r. Thus every non-zero element of 2// has an inverse, so 
that Z, = Z/I is a field. 

The prime subfield of a field K is defined to be the intersection of all 
subfields of K and is thus the unique smallest subfield of K. The rational 
numbers Q and the field Zp (p prime) have no proper subfields and are thus 
equal to their prime subfields. It can be shown that these are the only fields 
which can occur as prime subfields in the following theorem: 

Theorem 3.1-2: Every prime subfield is isomorphic either to the field Q of 
rational numbers or the field Zp of integers modulo a prime number. 

Proof: Let K be a field and P its prime subfield. P contains 0 and 1 and 
therefore contains the elements n* (n € Z) defined by 


n¥=1l+1+...4+1( times) if n > 0 (3.1—Sa) 
0* =0 . (3.1—5b) 
n* = —(—n)* ifn <0 (3.1-5c) 


Two distinct cases arise: 
(a) n* = 0 for some n #0. Since also (—n)* = 0, there exists a smallest positive 
integer p such that p* = 0. If p is composite, say p = rs where r and s are 
smaller positive integers, then r*s* = p* = 0 so that either r* = 0 or s* =0 
contrary to the definition of p. Therefore p is prime. The elements n* form a 
ring isomorphic to Zp, which is a field by Theorem 3.1-1. This must be the 
whole of P since P is the smallest subfield of K. 
(b) n* #0 if n #0. Then P must contain all of the elements m*/n* where m 
and n are integers and n # 0, These form a subfield isomorphic to Q by the 
map which sends m*/n* to m/n which is necessarily the whole of Pa 

A field K with a prime subfield isomorphic to Q is said to have 
characteristic 0. A field K with a prime subfield isomorphic to Z, is said to 
have characteristic p. 
This leads to the following lemmas!: 
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Lemma 3.1-3: If K is a subfield of L, then K and L have the same 
characteristic. 

Lemma 3.14: If k is a non-zero element of the field K and if n is an integer 
such that nk = 0, then n is a multiple of the characteristic of K. 

Sometimes it is possible to embed a ring R into a field, i.e., to find a 
field containing a subring isomorphic to R. For example, the integers Z can be 
embedded into the rational numbers Q in a way that every element of Q is a 
fraction whose numerator and denominator lie in Z. This concept is generalized 
by defining a field of fractions of the ring R as a field K containing a subring 
R" isomorphic to R such that every element of K can be expressed in the form 
r/s for r,s € R" where s #0. It can be shown that every integral domain 
possesses a field of fractions and that for any given integral domain R, all fields 
of fractions are isomorphic. An element r € R may be identified with its image 
[r,1] in the corresponding field of fractions F so that [r,s] is the fraction r/s in 
F, 

These concepts allow a precise logical definition to be given of a 
polynomial, which is the left side of the general algebraic equation 3.1—1. Let 
R be a ring in which multiplication is commutative. A polynomial over R in the 
indeterminate x can be defined by the following expression where r9,...,7%n € R, 
0 <ne Z, and x is undefined: 

TOF TIX +... +7 yx" (3.1-6) 
The elements ro,...rn are called the coefficients of the polynomial. If r, = 1, 
then the polynomial is called a monic polynomial. Terms of the type Ox” and 
1x” are frequently omitted and written as x”, respectively. Two polynomials 
are defined to be equal if and only if the corresponding coefficients of all terms 
are equal. The set of all polynomials over the ring FR in the indeterminate x is 
also a ring R[x] with the same operations as in R. The ring R[x] is called the 
ring of polynomials over R in the indeterminate x. If R is an integral domain, 
R[x] has a field of fractions as suggested by the following lemma: 

Lemma 3.1-5: If R is an integral domain and x is an indeterminate, then R[x] 
is an integral domain. : 

Proof: Suppose that f= fo +fix +... + fax" and g=go+gixt... + gmx™ 
where f, # 0 # gm and all of the coefficients f; and gz lie in R. The coefficient 
of x"*" in fg is fn2m Which is non-zero since R is an integral domain. Thus if f 
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and g are non-zero, then fg is non-zero indicating that R(x) is an integral 
domain.@ 

The field of fractions necessarily possessed by the integral domain R[] is 
called the field of rational expressions in x over R and can be denoted by K(x), 
where K is a field of fractions of R. 

If f is a polynomial over a commutative ring R and f # 0, then the 
degree of f, designated as of, is the highest power of the indeterminate x 
occurring in f with a non-zero coefficient. If R is an integral domain and f and 
g are polynomials over R, then the degrees of the polynomials have the 


following relationships: 
o(f + g) S maximum (Of,dg) (3.1—7a) 
O(fg) = of + 0g (3.1-7b) 


The < rather than = sign in equation 3.1—7a is necessary to allow for the special 
case of cancellation of the highest degree terms in fand g. 

A polynomial may always be divided by another polynomial provided 
that a remainder term is allowed leading to the following theorem: 

Theorem 3.1-6: If f and g are polynomials over a field K and f # 0, then 
there exist unique polynomials q and r such that g = fq + r where r has smaller 
degree than f. ; 
Proof: The proof uses induction on the degree of g. If dg = —e, then g = 0, 
and q and r may also be taken to be 0. If dg = 0, then g =ke K. If also f= © 
0, then f=j € K, and we may take q = k/j and r= 0. Otherwise of > 0 so that 
we may take g = 0 and r = g to start the induction. 

Now assume that the theorem holds for all polynomials of degree <n 
and let dg =n >0. If of > 0g we may as before take g = 0 andr =g. 
Otherwise we may take f= ayx'"+ ... + ag and g = byx" +... + bo where Qm, #4 
0#b, andm <n. Now start the division process g/f by putting 
21 = byQm1x"-f — g so that terms of the highest degree cancel, and dg} < Og. 
By induction we can find polynomials g and ry such that 21 =fgq1 + ry and or} 
< df. Now let g = Dydm-1x"-—™ — qi and r = —r) so that g = fg + rand or < of as 
required. : 

Finally we prove the uniqueness of g =fq +r. Suppose that g = 
fai +11 =fq2 + r2 where Or1,0r2 < of. Then f(qi — 92) =r1 — 12. However 
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(41 — G2) must have a higher degree than rj — 2 unless both are zero. Since f# 
0, 91 = q2 and rj =r2 = q andr are unique.@ 

Using this notation gq is called the quotient and r is called the remainder 
upon dividing g by f. The process of finding g and r is sometimes called the 
Division Algorithm or Euclidean Algorithm. 

The next step is to introduce the concepts of divisibility and highest 
common factor for polynomials. Thus if fand g are polynomials over a field 
K, then we say that f divides g (orf is a factor of g or g is a multiple of f) if 
there exists some polynomial h over K such that g = fh. The notation flg means 
that f divides g whereas the notation f4g means that f does not divide g. The 
polynomial d over K is called a highest common factor (hcf) of f and g if dif and 
dig, and further, whenever elf and elg, then eld. 

The following theorems relate to the properties of highest common 
factors: 

Theorem 3.1-7: If d is a hcf of the polynomials f and g over a field K and if 
0#ke K, then kd is also a hef for fandg. If d and e are two hcfs for f and g, 
then there exists a non-zero element k € K such that e = kd. 

Proof: Clearly kdlf and kdlg. If elf and elg, then eld so that elkd. Therefore kd 
is a hef. If d and e are hcfs, then by definition eld and dle so that e = hd for 
some polynomial h. Since eld the de < dd so thatdh sO > h=keK. Since 0 
#e=kd, thenk +0. 

Theorem 3.1-8: If fand g are non-zero polynomials over a field K and d is 
a hef for f and g, then there exist polynomials a and b over K such that d= af + 
bg. 

Proof: Let f=r_; and g =ro. Dividing f by g leads to the successive 
remainders r_j = g170 + 11 (Or1 $9Oro), ro = qari t+r2 (Ores dri), r1 =q3r2 +73 
(Or3 < Org), .:5 Ti = Gi42V it + 1142 (Ori42 S Orj41), etc. Since the degrees of 
the rj; decrease, a point must eventually be reached where the process stops, 
when rs42 = 0, so that the final equation is rs = q5+2rs41. Let d = rs41 since 
hcfs are unique up to constant factors. 

The induction hypothesis for proof of Theorem 3.1~--8 is that there exist 
polynomials a; and b; such that d = a;r; + bjrj4,. Using the above notation this 
is clearly true when i = 5 + 1 for then we may take a; = bj = 0. However, rj41 = 
Vi-1 — Gi+17j , and by induction d = ajr; + bj(r;~-1 — gj+17)) so that if aj; = bj and 
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bj-1 = aj — Djgi+7, then d = aj_1r;_1 + bj-17;. By descending induction d = a_yr_ 
1 + b_iro = af + bg where a = a_, and b = b_, thereby completing the proof. 

These concepts allow the development of a theory of the factorization of 
polynomials. A polynomial over a commutative ring is said to be reducible if it 
is a product of two polynomials of smaller degree. Otherwise it is irreducible. 
Thus all polynomials of degree 0 or 1 are irreducible since they certainly cannot 
be expressed as a product of polynomials of smaller degree. The polynomial 
x2 — 2 is irreducible over the rational numbers Q but can be reduced over the 
real numbers R by the equation 

x2 —2 = (x—V2)(x + ¥2) (3.1-8). 
This leads to the following theorems relating to the factorization of polynomials: 
Theorem 3.1-9: Any non-zero polynomial over a field K is a product of 
irreducible polynomials over K. 
Proof: Let g be any non-zero polynomial over K and use induction on the 
degree of g. If dg = 0 or 1 then g is automatically irreducible. If g 2 1, either g 
is irreducible or g = hj where dh and Oj < dg. By induction h and j are products 
of irreducible polynomials; therefore g is also such a product. The theorem 
follows by induction. 
Theorem 3.1-10: For any field K, the factorization of polynomials over K 
into irreducible polynomials is unique up to constant factors and the order in 
which the factors are written. 
Proof: Suppose that f=/1---f; = g1--gs where f is a polynomial over K and 
fi,.-.fr and 21,...,25 are irreducible polynomials over K. If all the f; are 
constant then fe K so that all of the g; are constant. Otherwise we may assume 
that no f; is constant by dividing out all the constant terms. Then f{lg1---gs and 
filg; for some i. Choose the notation so that i = 1, and then fjlg}. Since fj and 
g1 are irreducible and f] is not a constant, we must have f1 = k1g1 for some 
constant ky. Analogously fo = k2g2,....fr = krgr where k2,...,k, are constants. 
The remaining g; (j > r) must also be constant or else the degree of the right 
hand side would be too large. The theorem follows. 

Methods for testing the irreducibility of polynomials are very difficult 
just like methods for testing the primality of numbers. The following theorems 
are relevant to testing for the irreducibility of polynomials: 


The Symmetry of Equations 41 


Theorem 3.1-11: If f is a polynomial over the integers Z which is 
irreducible over Z, then f, considered as a polynomial over the rationals Q, is 
also irreducible over Q so that factorization over the integers Z is equivalent to 
factorization over the rationals Q. 

Proof: Suppose that fis irreducible over Z but reducible over Q so that f= gh 
where g and h are polynomials over Q of smaller degree. Multiplying through 
by the products of the denominators of the coefficients of g and h gives nf = 
g'h' where n eé Z and g' and h' are polynomials over Z. We now show that 
we can cancel out all of the prime factors of n one by one without going outside 
Z[x]. 

Suppose that p is a prime factor of n, and let g'= g9 + g1x +... + gyx! 
and h' = ho + hx + ... + hst’. Then either p divides all of the coefficients g; or 
else p divides all of the coefficients hj. If not, there must be smallest values i 
and j such that p 4g; and p 4h; . However, p divides the coefficient of x// in 
g'h' which is hogi+j + h1gisj-1 + ... Ajgi t+... + hj+jg0, and by the choice of i 
and j the prime p divides every term of this expression except perhaps hjgj. 
But p divides the whole expression so that plhjg;. This contradicts p Yg; and 
p4h;_ thereby proving the theorem.@™ 
Theorem 3.1-12 (Eisenstein’s Irreducibility Criterion): Let f(x) = ag + ax + 
... + a,x" be a polynomial over Z. Suppose that there is a prime q such that qg 

Kan, qiaj (0 <i < n—1), and g2dag. Then fis irreducible over Q. 
Proof: By Theorem 3.1—11 it suffices to show that f is irreducible over Z. 
Suppose for a contradiction f= gh where g = bo + bix +... + b,x", and h = cg 
+ CjX +... + esx) are polynomials of smaller degree over Z. Thenr+s=n. 
Now boco = ag so that glbg or gico. However, by assumption g cannot divide 
both bo and cg so that without loss of generality we can assume qlbg and q co. 
If all coefficients bj; were divisible by g, then a, would be divisible by q 
contrary to assumption. Let b; be the first coefficient of g not divisible by g so 
that aj = bjco + ... +bgcj where i<n. This implies that q divides co since g 
divides aj, bo,...,bj-1 but not b; . This leads to a contradiction thereby showing 
that fis irreducible.@ 

If f(x) is a polynomial over a commutative ring R, the zeros of f(x) 
correspond to the values of © e R where f(%) = 0. The following theorem 
relates the zeros of f(x) to its factors: 
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Theorem 3.1-13: If f(x) is a polynomial over the field K, then an element a 
é K is a zero of f(x) if and only if (v — @)If(). 
Proof: If (x — a)if(x) then f(x) = (« — @)g(x) for some polynomial g over K 
so that f(a.) = (a~—a)g(a) = 0. Conversely, suppose f(a) = 0. By Theorem 
3.1-6 there exist polynomials q and r over K such that f(x) = (x — a)g(x) + r@) 
where or <1. Thus r(x) =re K. Substituting o for x gives 0 = f(a) = 
(a— a)q(a) +r so that r = 0 and (x - &)If(x).@ 
If fis a polynomial over the field K, an element a € K is a Simple zero of f if 
(x — a)If(x) but & — a) f(x). Similarly the element a € K is a zero of f of 
multiplicity m if (¢ — 0)™If(x) but (x — a)" U(x). For example, the zero at o 
= 1 is a simple zero for x2 + x + 2 = (x — 1)(x + 2) but a zero of multiplicity 2 
for x3 + 3x —2 = (x — 1)2(x + 2). The following theorems relate to the 
multiplicity of zeros of polynomials: 
Theorem 3.1-14: If f is a non-zero polynomial over the field K with 
distinct zeros 041,...,%, With multiplicities m,...,m,, respectively, then fa) = 
(x — 04)1...(¢ — a,)’""g(x) where g(x) has no zeros in K. Conversely if f(x) = 
(x — 04)"1...(¢ — ay)!""g(x) where g(x) has no zeros in K, then the zeros of f 
in K are O1,...,0, with multiplicities m},...,my, respectively. 
Proof: For any a € K the polynomial x — is irreducible. Hence for any 
distinct o and B € K the polynomials x —a@ and x —f are coprime. The 
uniqueness of factorization (Theorem 3.1-10) leads to the relation f(x) = 
(x — 0)™1...(¢ — a,)"g(x), and g cannot have any zeros in K or else f would 
have extra zeros or zeros of larger multiplicity. The converse follows simply 
from the uniqueness of factorization (Theorem 3.1—10).@ 
Theorem 3.1-15: The number of zeros of a polynomial, counted according 
to multiplicity, is less than or equal to its degree. 
Proof: This follows from Theorem 3.1-14 where m, +... +m, Sof 

The original theory of Galois used polynomials over the complex field 
C. More recently in the 1920s and 1930s the methods of Galois theory were 
generalized to arbitrary fields. The central object of study then became the field 
extension related to a polynomial rather than the polynomial itself. Thus every 
polynomial f over a field K was used to generate another field L containing K 
(or a subfield isomorphic to K). In this connection, a field extension is defined 
as a map i:K—>L where K and L are fields; K is the small field and L is the large 
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field. The inclusion maps ij:Q-—>R, i2:R-C, and i3:Q-9C relating the 
rational, real, and complex numbers are all examples of field extensions. 

If i:K—L is a field extension, K can usually be identified with its image 
i(K) so that i can be considered an inclusion map and K a subfield of L. Under 
such circumstances the notation L:X can also be used for field extensions where 
L is an extension of K. If K is a field and X is a nonempty subset of K, then 
the subfield of K generated by X is the intersection of all subfields of K which 
contain X. 

Frequently in the case of a field extension L:K we are interested in fields 
lying between K and L. In such cases we are interested in subsets X of L 
containing K, i.e., sets of the form K U Y where Y CL. If L:K is an extension 
and Y is a subset of L, then the subfield of L generated by K U Y¥ is written 
K(Y) and said to be obtained from K by adjoining Y ¢ L. In general K(Y) is 
considerably larger than K UY. Thus the subfield R(i) of C must contain all 
elements x + iy where x,y € R so that C = R(i). Furthermore, the subfield of 
R consisting of all elements p + gV2 where p,g « Q is Q(V2). 

A field extension L:K is a simple extension if it has the property L = 
K(a) for some a € L; the extensions R(i) and Q(V2) are both examples of 
simple field extensions. In addition the field extension Q(V2,V3) may be 
reduced to the simple extension Q(V2 + V3) because of the relationships (V2 + 
V3)? = 5 + 2V6 and V6((V2 + V3) = 2V3 + 3V2 so that Q(¥2,V3) = Q(V2 + 
¥3). An isomorphism between two field extensions i:K—>K*, j:LL* is a pair 
(A,) of field isomorphisms A:K->L, p:K*—>L* such that for all ke K, 
JAK) = HG). 

Simple extensions can be considered to be algebraic or transcendental 
depending upon their relationship to polynomials. Let K(a):K be a simple 
extension. If there exists a non-zero polynomial p over K such that p(a) = 0, 
then @ is an algebraic element over K and the extension K(«):K is a simple 
algebraic extension. If there is no non-zero polynomial p of any finite degree a 
where p(a) = 0, then @ is transcendental over K, and K(a):K is a simple 
transcendental extension. The following theorem gives a method for 
constructing a simple transcendental extension of the field K: 

Theorem 3.1-16: The field of rational expressions K(x) is a simple 
transcendental extension of the field K. 
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Proof: The extension K(x) is clearly a simple extension. If f is a polynomial 
over K such that f(x) = 0, then f = 0 by definition of K(x).m 

If L:K is a field extension and @ € L is algebraic over K, then the 
minimum polynomial of © over K is defined to be the unique monic polynomial 
m over K of smallest degree such that m(a) = 0. The following theorems 
relating to minimum polynomials are relevant: 

Theorem 3.1-17: If @ is an algebraic element over the field K, then the 
minimum polynomial of « over K is irreducible over K and divides every 
polynomial of which @ is a zero. 

Proof: Suppose that the minimum polynomial m of o over K is reducible so 
that m = fg where f and g are of smaller degree. Assume that f and g are monic. 
Since m(a) = 0 by the definition of minimum polynomial, we have f(a)g(a) = 
0 so that either f(a) = 0 or g(%) = 0. However, this contradicts the definition of 
m as the minimum polynomial leading to the conclusion that m must be 
irreducible over K.@ 

Theorem 3.1-18: Let K(a):K be a simple algebraic extension where o has 
a minimum polynomial m over K. Then any element of K(a@) has a unique 
expression in the form p(a) where p is a polynomial over K and op < dm. 
Proof: Every element of K(a) can be expressed in the form f(a)/g(a) where 
f,@ © K{[x] and g(a) #0 since the set of all such elements is a field, contains K 
and a, and lies inside K(a). Since g(a) # 0, m does not divide g and since m 
is irreducible, m and g are coprime, i.e., their hcf is 1. By Theorem 3.1-8 
there exist polynomials a and b over K such that ag + bm = 1. Hence a(a)/g(a) 
= 1, s0 that fla)/g(a) = f(a)a(a) = h(a) for some polynomial hover K. Letr 
be the remainder on dividing h by m. Then r(a) = h(a). Since or < dm the 
existence of such an expression is proved. 

In order to show uniqueness suppose that f(a) = g(a) where df and dg 
<dm. If e =f-g then e(a) = 0 and de < dm. By definition of m we have e = 
0 so that f = g thereby proving the theorem.™ 
In addition, it can be proved! that if K is any field and m is any irreducible 
monic polynomial over K, then there exists an extension K(a):K such that a 
has minimum polynomial m over K. 

The degree of an field extension can be obtained by associating a vector 
space structure with the extension. Thus if L:K is a field extension, the 
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operations (A,u) ~—> Au (A € K,ue L) and (u,v) > u + v (u,v € L) define on 
L the structure of a vector space over K.!_ The degree [L:K] of a field 
extension L:K is the dimension of L considered as a vector space over K by this 
procedure. For example, the field C of complex numbers is two-dimensional 
over the real numbers R with {1,i} being a basis so that [C:R] = 2. It can be 
shown that if K,L, and M are fields and K GC Lc M, then [M:K] = 
[M:L][L:K]. The degree of a simple extension can easily be found by the 
following theorem!; 

Theorem 3.1-19; Let K(a):K be a simple extension. If it is transcendental, 
then [K(a):K] = o9, If it is algebraic, then [K(a):K] = 0m where m is the 
minimum polynomial of a over K. 

Proof: For the transcendental case it suffices to note that the elements 1, a, 
a2,... are linearly independent over K so that the corresponding vector space is 
infinite dimensional leading to infinite degree. For the algebraic case a basis is 
determined, Let dm =n and consider the n elements 1, a, ..., a"-!. By 
Theorem 3.1-18 these elements span K() over K, and by the uniqueness 
Clause of Theorem 3.1-18 they are linearly independent. Therefore they form a 
basis, and [K(a):K] =n = 0m. 

A finite extension is one whose degree is finite. Any simple algebraic 
extension is thus finite. However, the converse is not true. In this connection 
an extension L:K is algebraic if every element of L is algebraic over K. It can 
be shown! that L:K is a finite extension if and only if L is algebraic over K and 
there exist finitely many elements 0,...,a5€ L such that L = K(Q4,...,Q5). 


3.2 Galois Theory: Solubility of Algebraic Equations 
by Radicals 


Group theory was invented by Galois to study the permutations of the 
zeros of polynomials. Thus any polynomial f(x) has a group of permutations 
of its zeros, now called its Galois group, whose structure is closely related to 
the methods required for solving the corresponding polynomial equation f(x) = 
0. 

The problem of finding the Galois group of a given polynomial is 
Clearly an important one. Let K be a subfield of the field ZL. An automorphism 
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o. of L is defined to be a K-automorphism of L if a(k) = k for all k € K. This 
effectively makes @ an automorphism of the extension L:K rather than just the 
large field L. The set of all K-automorphisms of L in the field extension L:K is 
easily seen to form a group under composition; this group is called the Galois 
group Y(L:K). The role of Galois groups in solving polynomial equations 
relates to the observation originally by Galois that under certain extra 
hypotheses discussed later there is a one-to-one correspondence between 
subgroups of the Galois group of L:K and subfields M of L such that K ¢ M. 

A very simple illustration of a Galois group uses the extension C:R and 
the complex conjugation operation. If is an R-automorphism of C and j = 
a(i), then j2 = [a()]2 = a(i2) = a(-1) =-1 since a(r) =r for allre R. Thus 
either j =iorj=-—i. Nowa +iy)= a(x) + a(i)a(y) =x + jy for any x, y 
e R. This leads to two candidates for R-automorphisms, namely 04:x + iy 
—+x+iy (the identity operation) and Q2:x + iy —»x —iy (the complex 
conjugation operation). The maps 1 and 02 can be shown to be R-automor- 
phisms. Since 2? = 01, the Galois group T'(C:R) is the cyclic group of order 
2 (C2). 

If L:K is a field extension, a field M such that K ¢ M ¢ Lis called an 
intermediate field. A group M4 =T(L:M) of all M-automorphisms of L can be 
associated with each intermediate field M. Using this terminology, K® is the 
entire Galois group, and L® = 1, namely the identity map on L. Thus if the 
field M is a subfield of N, ic., M GN, then the group N* is a subgroup of 
M4, ie., N* CM4. Conversely, if each subgroup H of P(L:K) has an 
associated set Ht of all elements x € L such that a(x) =x for all a € H, the set 
Ht can be shown to be a subfield of L containing K.! The field H? is called the 
fixed field of the subgroup H. 

The concept of a splitting field can now be introduced which relates to 
the phenomenon that a polynomial with no zeros over one field may have zeros 
over a larger field. Thus the polynomial x2 +1 has no zeros over R but the 
zeros ti over C. Every polynomial can be resolved into a product of linear 
factors indicating its full complement of zeros if the original field is extended to 
a suitable splitting field. If K is a field and f is a polynomial over K, then f is 
said to split over K if f can be expressed as a product of linear factors f@ 
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k(x-4)...(4-Oy,) where k,&1,...,0, € K. The field > is defined as a splitting 
field for the polynomial f over the field K if the following conditions are met: 

@ Kc, 

(2)  f splits over 2, 

(3)  f splitting over Y' if K cy’ cD implies Y' = &. 
A splitting field can be constructed by adjoining, to the field K some additional 
elements which are zeros of f in accord with the following theorem: 
Theorem 3.2-1: If K is any field and f is any polynomial over K then there 
exists a splitting field for f over K. 
Proof: Induction is used on the degree Of. If df = 1 there is nothing to prove 
for f splits over K. If f does not split over K, then it has an irreducible factor 
f1 of degree > 1. In this case adjoin a to K where f(a) = 0. Then in K(Q)[7] 
we have f = (x — &)g where dg = Of — 1. By induction there is a splitting field 
> for g over K(a). But then > is a splitting field for f over K. 
Furthermore if i:K-—>K' is a field isomorphism, if T is a splitting field for f 
over K, and if 7" is a splitting field for i(f) over K', then the extensions T:K 
and 7°:K’ can be shown to be isomorphic. 

The concept of a splitting field can be illustrated by the equation 

fx) = @? -3)@3 +1) 
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The splitting field for this equation inside ‘ can es be shown to be 
Q(3,i). 


Normal field extensions are particularly important in Galois theory. In 


(3.2—1). 


this connection an algebraic field extension L:K is defined to be normal if every 
irreducible polynomial f over K which has at least one zero in L splits in L. 
For example, the algebraic field extension C:R is normal since every poly- 
nomial whether or not irreducible “pu in C. An example of an algebraic field 
extension which is ne ponnal is Q( 2) since the mance polynomial x3 — 2 
has a zero, namely 33 j in ay), but does not split in Q({2). A normal finite 
extension such as C:R has a well-behaved Galois group in the sense that the 
ae a correspondence is a bijection. However, the nonnormal extension 
ay): Q has a badly behaved Galois group. The following theorem indicates 
the close connection between normal extensions and splitting fields: 
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Theorem 3.2—2: An extension L:K is normal and finite if and only if L isa 
splitting field for some polynomial over K. 

Proof: If L:K is normal and finite, it can be expressed as L = K(Q1,...,Q5) 
for certain 0; algebraic over K. Let mj be the minimum polynomial of 0; over 
K and let f = my---ms. Each m; is irreducible over K and has a zero aje L so 
by normality each m; splits over L. Therefore f splits over L. Since L is 
generated by K and the zeros of f, it is a splitting field for f over K.@ 

The additional concept of separability is required for fields of non-zero 
characteristic. An irreducible polynomial f over a field K is separable over K if 
it has no multiple zeros in a splitting field. This means that in any splitting field 
f can be expressed as f(x) = k(x — 01)...(% — Gy) where the 6; are all different. 

With these concepts in mind the fundamental properties of the Galois 
correspondence can be established between a field extension and its Galois 
group. Let L:K be a finite separable normal algebraic field extension of degree 
n with Galois group G, which consists of all K-automorphisms of L, let ¥ be 
the set of intermediate fields M, and let G be the set of all subgroups H of G. 
Furthermore, if Me ¥, then M® is the group of all M-automorphisms of L, and 
if He G, Ht is the fixed field of H. The fundamental theorem of Galois theory 
then states the following: 

(1) The Galois group G has order n; 

(2) The maps * and 7 are mutual inverses and establish an order-reversing 
one-to-one correspondence between F and G; 

(3) If M is an intermediate field, then the degree of the extension L:M is 
equal to the order of the group M4, i.e., [L:M] = M41, and the degree of the 
extension M:K is equal to IGI/IM4|; 

(4) An intermediate field M is a normal extension of K if and only if M4 isa 
normal subgroup of G (Section 2.1); 

(5) If an intermediate field M is a normal extension of K, then the Galois 
group of M:K is isomorphic to the quotient group G/M. 

Details of the proof of this theorem are given elsewhere. 

The Galois correspondence expressed by the above theorem can be used 
to derive a condition which must be satisfied by an equation soluble by radicals, 
namely that its Galois group must be a soluble group (Section 2.1). In this 
connection an extension L:K is defined to be a radical extension if L = 
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K(01,...,0m) where for each i = 1,...,m, there exists an integer n(i) such that 
oj") e K(ay,...,0n-1). The elements 0; are called a radical sequence for L:K. 
Informally, a radical extension is obtained by a sequence of adjunctions of nth 
roots for various values of n. If f is a polynomial over a field K of 
characteristic zero and > is a splitting field of f over K, then f is said to be 
soluble by radicals if there exists a field M containing % such that M:K isa 
radical extension. The Galois group of f over K is the Galois group (2:4). 

The Galois group G of a polynomial f over the field K with a splitting 
field ¥ over K can be viewed as a group of permutations on the zeros of f. 
Thus if we ) is a zero of f, then f(a) = 0 so f[g(a)] = glf(a)] = 0 for any g 
€ G. In this way every element g € G induces a permutation &' of the set of 
zeros of f in > thereby defining an injection of G into the permutation group of 
the zeros of f. 

The irreducibility of a polynomial f over the field K (Section 3.1) can be 
related to the transitivity of its Galois group G. Consider a reducible 
polynomial f = gh where g and h are irreducible so that Of = 0(gh) = dg + Oh 
(see equation 3.1-7b). The roots of the algebraic equation f(x) = g(x)h(x) = 0 
consist of the union of the dg roots of g(x) and the dh roots of h(x) so that 
determining the roots of f(x) requires solving one equation of degree dg, 
namely g(x), and one equation of degree dh, namely A(x) rather than the more 
complicated task of solving a single equation of degree Of, namely f(x). This, 
of course, is how factoring an algebraic equation simplifies determination of its 
roots. The Galois group G of f(x) is an intransitive group with two orbits. One 
orbit corresponds to the dg zeros of g(x), and the other orbit corresponds to the 
dh zeros of h(x). In this way intransitive Galois groups can be seen to 
correspond to reducible polynomials and transitive Galois groups to irreducible 
polynomials. 

Methods for solving irreducible algebraic equations are thus seen by 
Galois theory to correspord to transitive permutation groups, 1.e., the Galois 
‘groups of the irreducible equations. The transitive permutation groups of 
degrees less than eight, and thus corresponding to irreducible equations of 
degrees lower than eight, are listed in Table 2—3. All of the transitive groups of 
degree 4 or lower are soluble groups so that all algebraic equations of degree 4 
or lower are soluble by radicals (Sections 5.1 and 5.2). The lowest degree 
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simple group is the degree 5 group As of order 5!/2 = 60, which means that the 
general quintic equation with As5 or Ss Galois group cannot be solved by 
radicals. However, quintic equations with the dihedral group Ds of order 10 or 
the metacyclic group Ms of order 20 can be solved by radicals since these 
groups are soluble (Section 5.3). Recent papers have given algorithms using 
radicals to determine the roots of such solvable quintic equations. 

The insolubility of quintic equations having the simple Galois group A5 
using only radicals does not mean that they cannot be solved using functions 
more complicated than radicals. Thus suppose that for any real number a the 
ultraradical Va is defined as the real zero of x5 +x-—a. Then Jerrard has 
shown that the general quintic equation can be solved by the use of such 
ultraradicals (Chapter 7),4 which turn out to be elliptic functions (Chapter 4). 
However, conversion of the general quintic equation to the so-called Bring- 
Jerrard equation x> +x —a requires a very difficult Tschirnhausen trans- 
formation (Section 3.3),> which is avoided in the alternative method of 
Kiepert® for solution of the general quintic equation (Chapter 6). 

Table 2~3 containing the transitive permutation groups of degree less 
than eight suggests other types of equations with simple Galois groups 
preventing them from being solved using only radicals. All of the alternating 
groups A, (n 2 5) are simple which means that the general equations of any 
degree greater than five cannot be solved using only radicals. The difficulties of 
solving such equations appear to increase factorially according to the n!/2 order 
of the alternating group Ay. A complete solution of the general sextic equation, 
including all of the necessary Tschirnhausen transformations (Section 3.3), 
does not appear to have been worked out. However, a special sextic equation, 
theoretically obtainable from the general sextic equation by Tschirnhausen 


2p. S. Dummit, Solving Solvable Quintics, Math. Comp., 57, 387 (1991). 
3S. Kobayashi and H. Nakagawa, Resolution of Solvable Quintic 
Equations, Math. Japonica, 37, 883 (1992). 

4A. Hausner, The Bring-Jerrard Equation and Weierstrass Elliptic 
Functions, Amer. Math. Monthly, 69, 193 (1962). 

SA. Cayley, On Tschirnhausen’s Transformation, Phil. Trans. Roy. 
Soc. London, 151, 561 (1861); Cayley’s Collected Works, paper 275. 


6, Kiepert, Auflésung der Gleichungen fiinften Grades, J. fiir Math., 
$7, 114 (1878). 
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transformations, has been solved using double (genus 2) theta functions 
(Section 8.1). In the case of the septic (degree 7) equation, simple groups arise 
not only for the general septic equation with A7 or S7 Galois groups but also for 
the special septic equation with the simple LG,2) Galois group of order 168 
(Table 2.1).7 In the 19th century Klein’ and then Radford? studied this special 
septic equation in some detail (Section 8.2). 


3.3. Tschirnhausen Transformations 


Consider the general monic algebraic equation of degree n, i.e., 
n~| 
x + alt ay = x" + Yan pi =0 (3.3-1). 
i=] 
A useful prelude to the solution of such an equation may involve substituting a 
new variable 
n—| . 
Ye = OQ + OAR +... + Appx! = Sax (3.3-2) 
j= 
where x, (1 $k <n) is a root of equation 3.3-1 to give a new monic algebraic 
equation of the same degree, i.e., 
n—-1 
yh + Aryl +... +An = yn t SY Ani =0 (3.3-—3). 
i=l 
Such a procedure is called a Tschirnhausen transformation'.11 and is 
beneficial when the coefficients a, (0 <m <n—1) in equation 3.3—2 are chosen 


so that equation 3.3-3 is simpler to solve than equation 3.3-1. After the 
transformed equation 3.3—3 is solved to give the roots yy (0S k <n), then the 


7H. Weber, Lehrbuch der Algebra, Vieweg, Braunschweig, 1898, 
Volume 2, §§ 141~—147, 

8F. Klein, Uber die Transformation siebenter Ordnung der 
elliptischen Funktionen, Math. Ann., 14, 428-471 (1879), Gesammelte 
Mathematische Abhandlungen, Volume 3, pp. 90~136, Springer 

_ Verlag, Berlin, 1923. 


9E. M. Radford, On the Solution of Certain Equations of the Seventh 
Degree, Quart. J. Math., 30, 263-306 (1898). 

100. Perron, Algebra, Third Edition, de Gruyter, Berlin, 1951, Chapter 
2, § 16. 


lIL. E. Dickson, Modern Algebraic Theories, Sanborn, Chicago, 1930, 
Chapter 12. 
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Tschirnhausen transformation is undone by solving equation 3.32 to give the 
roots xz (0 sk <n) as functions of yx. The value of Tschirnhausen 
transformations to facilitate the solution of algebraic equations thus relates not 
only to the ease of solving the transformed equation 3.3—3 but also to the degree 
of the equation 3.3—2 expressing yg in terms of Xz. 

The simplest example of a Tschirnhausen transformation is used in the 
solution of the monic cubic equation 


x3 + ayx2 + anx +03 =0 (3.3-4). 
A new variable y can be defined by the linear equation 

year+ F (3.3-5) 
which corresponds to equation 3.3-2 with 0 = aj/3, and a2 =0. This gives 
the monic cubic equation 

y? + 3A2y + A3 =0 (3.3-6) 


in which the y2 term vanishes, i.e., A] =0. A cubic equation with the y2 term 
missing can readily be solved by well-known methods using nested square and 
cube roots or by using trigonometric functions (Section 5.1). Solving the 
linear equation 3.3—5S to give x, as a function of yy is trivial and gives 

m= FB (3.3-7) 
Thus after the roots yg of equation 3.3-6 are found, the corresponding roots xx 
of equation 3.34 can be found using equation 3.3—7. 

The Tschirnhausen transformation is frequently useful for simplifying 
algebraic equations when the coefficients o% of equation 3.3—2 are so chosen 
that some of the coefficients Ag of equation 3.3-3 vanish. This can be done by 
using the Newton relationships between the sums of the roots of equations 
3.3-1 and 3.3—3 and their coefficients, 1.¢., 


yx, +a, =0 (3.3—8a) 
Sx? + ay dx + 2a2 = 0 (3.3~8b) 
— Ygd + a Dx? + adr + 303 =0 «.. (3.3-8c) 
for equation 3.3—1 and 
Typ + Ap =0 (3.3-9a) 
Ly2 + ArLyx + 2A2 = 0 (3.3-9b) 


Ly + Aly? + Ardy + 343 =0 ... (3.3-9c) 
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for equation 3.3-3. For example, if the objective is for the A,y”-! term in 
equation 3.3~3 to disappear (e.g., transformation of the general cubic equation 
3.34 to the special cubic equation 3.36), then Ly, = 0 by equation 3.3~9a. 
This can be achieved by a transformation 


Yk = Ay + Xf (3.3~10) 
where a; = 1. Summing equation 3.3-10 over k gives 
LYE = NA +Lxy (3.3-11) 


for an equation of degree n. Setting Sy, = 0 and using equation 3.3~8a gives 
OQ = a/n which gives equation 3.3—7 for the cubic equation where n = 3. 

In the solution of the general quintic equation a Tschirnhausen trans- 
formation is used with the objective of converting the general quintic equation 


+ ayx4 + a9x3 + agx2 + a4x + a5 =0 (3.3~12) 
to the so-called principal quintic equation 
y? + Azy2 + Agy + A5=0 (3.3—13) 


with the objective of making the y4 and y3 terms disappear, i.e., A] = A7 = 0. 
This requires a transformation 

Vk = OQ + AXE + OX, (3.3-14) 
Squaring equation 3.3—14 gives 

Yer = 0% + ZEA x, + 42x42 + ayer? + 2007x,3 + 0tn2x,4 

(3.3-15) 

Summing equations 3.3~14 and 3.3-15 over all of the roots and setting Diy, = 
Dyk” = 0 to make Ay = A? = 0 gives 


Ly~ = NA + yxy + Yx,2a2 = 0 (3.3-16) 
Dy = nag? + 2UxpAQay+ DVrxzp2042 + 2>:x470902 + 221x430 a2 + 
Lxy4a? = 0 (3.3-17) 


Solving these simultaneous equations leads to a quadratic equation for the ratio 
01:02 which thus can be solved by adjoining a square root. Thus a 
Tschirnhausen transformation to make Aq = A2 = 0 is therefore possible and 
requires only adjoining one square root. 

Now consider a Tschirnhausen transformation of equation 3.3~1 to 
3.3-3 where the y”-!, yn-2, and y”-3 terms are made to vanish by setting Aj = 
A2 = A3 = 0. In the case of the general quintic equation (equation 3.3—12), this 
leads to the Bring-Jerrard equation 

y> —Agy + As =0 (3.3—18). 
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Such a Tschirnhausen transformation uses a relationship 


Ve = OQ + Cy Xp + Ox? + 03x43 + oax (3.3-19). 
The requirement A; =0 => Lyx = 0 leads to the condition 
nO + Oy Dxg+ OQDx2 + 03LxP3 + 4dr 4=0 (3.3-20). 


The additional requirements Az = A3 =0 > Yy,2 = Lye = 0 lead to two 


homogeneous equations of degrees 2 and 3, namely 


4 4 

SY Vat HYagop = (3.3-21a) 
A=0 p=0 

a oe 

pg a SY (lt Vag ody = 0 (3.3-21b) 
A=0 y=0 v=0 


Solving equation 3.3-20 for ao in terms of Om (m = 1,2,3,4) and substituting 


this value in equations 3.3—21 gives the following equations: 


4 4 

SBapaaap, =0 (3.3-22a) 
A=1 p= 1 
4 4 4 
YY VMCapvoaapor = 0 (3.3-22b) 
A=1 p=1 v=0 


It can be shown that equation 3.3—-22a can be satisfied by adjoining two square 
roots. Similarly, equation 3.3-22b can be shown to require a cubic equation to 
determine to determine some of the @, values. This Tschirnhausen transforma- 
tion thus can become very difficult to carry out in practice. Cayley> gives this 
Tschirnhausen transformation in detail for converting the general quintic 
(3.3-12) to the Bring-Jerrard quintic (3.3-18), but it is very complicated and 
tedious. 

The original method for solution of the general quintic equation using 
elliptic functions reported by Hermite in 1858 required this very complicated 
Tschirnhausen transformation for converting the general quintic (3.3-12) to the 
Bring-Jerrard quintic (3.3-18), which could readily be solved using elliptic 
functions (Chapter 6). However, an easier procedure for solving the general 
quintic equation uses the much easier Tschirnhausen transformation of the 
general quintic (3.3—12) to the principal quintic (3.3-13) in which only Aj and 
Ag are zero, Then a second Tschirnhausen transformation of a different type is 
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used which is derived from the special symmetry of the icosahedron!2 as 
described in Section 6.3. This Tschirnhausen transformation converts the 
principal quintic (3.3-13), namely an example of an equation (3.3—-1) where 
a\= a2 = 0, to an equation of the following type known as the Brioschi quintic: 

z) ~ 10Zz3 + 45Z2z —Z2 =0 (3.323) 
The Brioschi quintic is not only an example of an equation of the type 3.3~3 
where A = A3 = 0 but also has coefficients which can be described by a single 
parameter conventionally designated as Z. The analogue of equation 3.3—2 
relating the ygs of the principal quintic equation 3.3—-13 in y to the z,z’s of the 
Brioschi quintic equation 3.3—23 in z can be derived by solving the following 
equation for zg: 

A+ Uz, 

** @AIZ) -3 
Note that solving equation 3.3-24 for zg in terms of yg does not give a 
polynomial for yg in terms of zg. Thus the transformation of the principal 
quintic (3.313) to the Brioschi quintic (3.3-23) is a more general type of 
Tschirnhausen transformation than those defined by equations 3.31 to 3.3-3 
where the analogue of equation 3.3—2 is not a simple polynomial. 


(3.324) 


12R, B. King and E. R. Canfield, Icosahedral Symmetry and the Quintic 
Equation, Comput. and Math. with Appl.,24, 13-28 (1992). 


Chapter 4 
Elliptic Functions 
4.1 Elliptic Functions by the Generalization of Radicals 


The previous chapter discusses the application of group theory to the 
symmetry inherent in algebraic equations with the idea of relating the Galois 
group of an equation to the methods required for its solution. Thus a 
consequence of the solubility of all permutation groups of degrees 2, 3, 4 is the 
fact that radicals of the general type Va (n = 2 and/or 3) are sufficient for 
solving any quadratic, cubic, or quartic algebraic equation (i.e., equations of 
degrees 2, 3, and 4, respectively). These radicals Na can be evaluated using 
logarithms as follows where In a = logea: 


Ya sain = ill, In a)= exp a In a) (4.1-1) 
- The natural logarithm required for equation 4.1—1 arises from the following 


indefinite integral: 


Inx= [Ss (= (4.1-2) 


Thus the logarithm is an example of a transcendental function that can be 
expressed in the general form 


dx 


V2) 


FO) = (4.1-3) 


in which P(x) is a polynomial. In the case of the natural logarithm the 
polynomial A(x) is the degree 2 polynomial x, 

In some cases, notably the solution of the general cubic equation with 
three real roots (Section 5.1), it is more convenient to substitute trigonometric 
functions for radicals, since the necessary radicals would require the difficult 
calculation of real numbers by taking cube roots of complex numbers. These 
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trigonometric functions can be generated by inversion of integrals of the type 
4.1-3 where Pis also a degree 2 polynomial. Thus consider the expression 


‘x + DL Xx: b 
= [ee + [dd 
=~ Var—x2 + Z Bie Va2—x2 + bu (4.1-4) 
at 


a*—x 
However 
Bs u = sin-lx/a (4.1-5). 
a) g2x2 


Inversion of equation 4.1-5 gives x =a sin u. Setting a = 1 gives 


dates (So (4.1-6) 
Vi-x2 | 


The above analysis shows how logarithms and trigonometric functions, 
which are sufficient to solve all quadratic, cubic, and quartic equations, can be 
derived from integrals of the type 4.1-3 in which P(x) is a second degree 
polynomial. However, radicals, and thus logarithms and trigonometric func- 
tions, are no longer sufficient for solution of the general quintic equation, i.e., a 
general equation of degree 5, since the corresponding Galois group of the 
general quintic equation is the alternating group As, which is a simple group. 
In order to solve the general quintic equation the concept of radicals must be 
generalized to what have been called ultraradicals, i.e., Va for the real zero of 
x3 +x —a.1! Such ultraradicals for the general quintic equation turn out to be 
elliptic functions, which can be derived from integrals? of the type 4.1-3 in 
which the polynomial P(x) is of degree 3 or 4 analogous to the derivation of 
logarithms and trigonometric functions from such integrals (4.1—3) in which the 
polynomial A(x) is of degree 2. Thus consider the following expression which 


11, Stewart, Galois Theory, Chapman and Hall, London, 1973. 


2H. E. Rauch and A. Lebowitz, Elliptic Functions, Theta Functions, and 
Riemann Surfaces, Williams and Wilkins, Baltimore, MD, 1973. 
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is the cubic analogue to equation 4.1—4: 
x2 +b 
em LA ee 
[Feremn 
(x2 + a/3) (b — a/3) 
Je rer xs+axt1 
=o Stax +1 + | oma sais} (4.17) 


xe+ax+1 


Equation 4.1-7 for b —a/3 ¥ 0 gives an integral whose evaluation for 
a> # -27/4 in terms of familiar elementary functions (algebraic, logarithmic, 
exponential, and trigonometric) was unsuccessfully sought for more than a 
century after Newton’s and Leibniz’s basic discoveries of calculus By the end 
of the 18th century the leading mathematicians such as Euler, Lagrange, and 
Legendre were rather sure that integrals of the type 4.1-3 where P(x) is a degree 
3 polynomial represented a new type of transcendental function. Such 
integrals, even before they were understood, were called elliptic integrals since 
related integrals arise in the determination of the arc length of an ellipse. 

The original work on elliptic functions was done in the late 1820s by 
Abel and Jacobi using an integral of the type 4.1-3 where Px) is the fourth 
degree polynomial 

Ax) = (1-x2)(1-k2x2) (4.1-8). 
This fourth degree polynomial for P(x) was chosen because of its resemblance 
to the second degree polynomial A(x) = 1 — x? found in the integral expression 
for sin~!x (equation 4.1-6). This also relates to extension of the single 
periodicity of the trigonometric functions to the double periodicity of the elliptic 
functions with the modulus k being the ratio between the two periods of the 
elliptic functions (Section 4.2). 

The first elliptic functions to be defined used the following definite 
integral to define a new function f(u): 


x=f(u) 
a dx 
IF (1—x2)(1-k2x2) 
=0 


(4.1-9) 
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Jacobi made the trigonometric substitution x = sin ¢ so that 


__cos 0d0- > dé (4.1-10) 
~ | af cos26C-k2sin20) J 1—k2sin20 


The variable ¢ was then called the amplitude of u so that x = f(u) = sin @= sin 
amplitude u abbreviated first by Jacobi to sin am u and subsequently by Guder- 
mann to sn u, which is the most commonly used notation today. The other 
Jacobi elliptic functions are defined as follows: 


cnu=cos @= V1 — sn2u (4.1-11a) 
dnu=V1 ~ k2sn2u (4.1—11b) 


The explicit dependence of sn u on the modulus k can be introduced by writing 
sn(u,k). The complementary modulus k' is defined by k' = V1 — k2. Note 
that sn (0,k) = 1 for all k. 

The Jacobi elliptic functions sn u, cn u, and dn u defined by equations 
4.1-10 and 4.1—11 use an integral of the type 4.1-3 where P(x) is a quartic 
polynomial. It is now instructive to relate this type of integral to similar 
integrals where A(x) is only a cubic polynomial. Consider the use of an elliptic 
integral to define an elliptic curve by first defining a general quartic polynomial 
by the equation 

w2 = Pz) = a4z4 + a323 + a2z2 + ajz + ag 

= a4(z ~ 21)(Z — 22)(z — 23)(z — 24) | (4.1—12) 
in which ag,...,@4 are arbitrary complex constants and z1,...,24 are the roots 
(assumed distinct) of the polynomial in equation 4.1-12. If z is a complex 


_ variable, then equation 4.1~12 defines w as a two-valued algebraic function of 


z. Abel then defined an elliptic integral to be an integral of the form JR(z,w)dz 
in which R(z,w) is a rational function of the complex variables z and w. The 
locus of all values of z and w satisfying equation 4.1-12 can appear as a two- 
dimensional locus in the four-dimensional space of the two complex variables z 
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and w. Equation 4.1-12 may be regarded as defining an algebraic curve so that 
a function of the form R(z,w) is then a function on this curve. Because 
integrals of the type 4.1-3 where A(x) is a cubic or quartic polynomial are called 
elliptic integrals, this curve is called an elliptic curve. 

The most general elliptic curve defined by equation 4.1-12 would 
appear to depend on four parameters, namely zz (1 sk <4). However, for the 
purposes of integration only one complex parameter is required as can be seen 
by putting equation 4.1—12 into a standard form, called the normal form. In 
order to put equation 4.1—12 into the normal form, first define 


1 _ (2 = 24)(23 — 22) 
= (= 29) 23 — 24) More 


Applying equation 4.1—13 to z' = z1 gives 


1 (z1 — 24)(z3 — 22) 
4 (zi — 22)(23 — 24) ee 


where A 1s called the cross-ratio of z1,...,24. Similar applications of equation 
4.1-13 take z9 to ©9, z3 to 1, and zq4 to 0. Now solve for z as a function of z’ 
and substitute z' for z in the right-hand side of equation 4.1~12 to give 


= 4 
wf 1 i Gey = ag'z'(1 — z')(1 — Az’) (4.1-15) 
be (pee AO eco ee 2 

where a4’ = (22 — 24)2(z3 — 24) E = 2 fi (4.1-16) 
Now define 

w'=w E _ 7{23—~ 44 | ‘iy (4.1-17) 

23-22 Vag! 

Then 

w'2 = a4'z'(1 — z')(1 — Az’) (4.1-18a) 
and = w"2 = 4z'(1 —z')(1 — Az’) (4.1-18b) 


Dropping all primes gives Riemann’ s normal form of the elliptic curve. Thus if 
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an elliptic curve is defined by (4.1-12), its normal form can be written by 
computing the cross-ratio of the four roots of the polynomial. If the arbitrary 
order in which the roots are taken is changed, 2 will be replaced by 7 


. Note also that in Riemann’s normal form 


> 


Pe ae ed 
(4.1-18) an elliptic curve in (4.1~12) is defined by a cubic equation rather than 
the original quartic equation. Furthermore, suppose that a4 in (4.1—12a) is zero 
so that the resulting quartic equation becomes a cubic equation with three 
distinct roots. This still defines an elliptic curve, and the transformation to 
Riemann’s normal form is nearly trivial since there is no need to send one root 
to co by use of the transformation in (4.1-13). Thus if the roots of the cubic 


equation are z, 22, and z3, then the corresponding linear transformation z' = 
Z—-Zq 
22-21 


leaves oo fixed and has the following additional effects: 


z3 70,2231 ee 
? ? A 22-2] 


(4.1-19) 
In this way the cubic can be viewed as a degenerate quartic with one zero at ©, 
The two branches of w arising from the square root in solving equation 4.1-12 
for w coalesce at three zeros and one infinity in the cubic as a degenerate quartic 
rather than at four zeros in the case of a true quartic. This shows that cubic and 
quartic polynomials as P(x) in equation 4.1-3 give the same kind of transcen- 
dental functions, namely the elliptic functions. Any other degenerate form of 
the quartic equation on the right side of equation 4.1—12, such as a double root 
or reduction to a quadratic or linear polynomial, leads back to more elementary 
functions, namely algebraic, logarithmic, exponential, and/or trigonometric 
functions. 

The differential equations derived from Riemann’s normal forms 
(equations 4.1—18) lead to another integral of the type 4.1—-3 analogous to 
equation 4,1—9 but where #(x) is now a cubic polynomial, namely 


sie sites (4.1-20) 
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The function f(u) defined by inversion of this integral is called the Weierstrass 
fo-function and is conventionally designated as go (u) or more precisely 
(2 (322,23) to emphasize the role played by the parameters g2 and g3. The 
function 42(u) tums out to be single-valued and analytic except for an infinity of 
poles of second order. Since the inversion of the general integral 4.1—20 is a 
formidable problem, an alternative approach to the Weierstrass elliptic functions 
based on a study of doubly periodic functions discussed below is more useful. 


4.2 Elliptic Functions as Doubly Periodic Functions 


Let f(z) be a single-valued function which is analytic on the complex 
plane z =x + iy except for isolated singularities such as poles, which are 
defined as points where f(z ) = 0. A period, p, of f(z) is a complex number 
such that 

F(z) = f(z +p) (4.2-1) 
for all z for which f(z) is analytic. A function which has one (non-zero) period. 
has an infinity of periods, e.g., mp for all integers n. Let Q be the set of all 
points in the Argand (complex) plane, z = x + iy, corresponding to periods of a 
function f(z). Periodic functions can be of the following types: 

(1) The trivial case where f(z) is a constant so that Q is the whole complex 
plane; 

(2) The case where f(z) is a single periodic function, such as the 
trigonometric functions sin z, cos z, tan z, etc., so that QO is a system of 
equidistant points on a straight line through the origin of the complex plane; 

(3) The case where f(z) is a doubly periodic function so that Q is a point 
lattice formed by intersections of two families of equidistant parallel lines. 

The point lattice of a doubly periodic function f(z) in the Argand 
(complex) plane may also be generated by the repetition of congruent parallelo- 
grams. Consider one such parallelogram with one of its vertices at 0 and let the 
other three vertices be 2, 2@', and 2m + 2@'. Then 2m and 2’ are a pair of 
primitive periods of f(z), and all periods of f(z) have the following form where 
m and n are integers: 

20mn = 2mMO + 2nw (4.2~2) 
Clearly the ratio @'/@ is not real. The primitive periods are conventionally 
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chosen such that Im(@'/@) > 0 where Im z refers to the imaginary part of the 
complex number z. 

Two points of the complex plane corresponding to the variable z of a 
doubly periodic function are said to be congruent if they differ by a period. A 
connected set of points is called a fundamental region if every point of the plane 
is congruent to exactly one point of the set. The fundamental region is chosen 
to be a parallelogram with two sides and the vertex at which they intersect being 


- considered as part of the parallelogram and the other two sides and three 


vertices not forming part of the parallelograms. If zo is fixed the points 

z=29 + 2€@+ 2a (4.2—3) 
where 0 < &,7) < 1 form the fundamental period parallelogram. Any parallelo- 
gram obtained from the fundamental period parallelogram by translation by an 
integral number of periods is a period parallelogram, or a mesh, which 
corresponds to a set of points 

z=zyt+2(m+ Got n+ (4.2-4) 
with a fixed pair of integers {m,n}. Figure 4-1 shows two different ways of 
choosing period parallelograms for a given period lattice Q. Since a doubly- 
periodic function assumes the same value at congruent points, it is sufficient to 
describe the behavior of such a function in any single mesh. Furthermore, 
since f(z) has only isolated singularities and isolated zeros, the fundamental 
period parallelogram (i.e., z@ in equation 4.2—3) can be chosen so that no 
singularities or zeros of f(z) lie on the boundary of such a mesh. A mesh with 
these properties can be called a cell. 

An elliptic function is defined as a single-valued doubly-periodic 
analytic function whose only possible singularities in the finite part of the plane 
are poles. Such elliptic functions have the following properties:3 
(1) Every nonconstant elliptic function has poles. Otherwise, if f(z) has no 
poles in a mesh, it is bounded there and hence in the entire complex plane so 
that it must be a constant by Liouville’s theorem. 

(2) An elliptic function has only finite numbers of poles and zeros in any mesh, 
if it does not vanish identically. Infinities of poles or zeros in a mesh imply the 


3A. W. Erdélyi, W. Magnus, F. Oberhettinger, and F. C. Tricomi, Higher 
Transcendental Functions, McGraw-Hill, New York, 1953, Chapter 13, 


part 2. 
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presence of singularities other than poles. The number of poles in a cell, each 
pole counted according to its multiplicity, is called the order of an elliptic 
function. The set of poles and zeros in a given cell is called an irreducible set. 
(3) The sum of the residues of an elliptic function at its poles in any cell is zero. 
If C is the boundary of the cell, the sum of the residues te f(2dz is zero since 
the integrals along opposite sides cancel. 

(4) There is no elliptic function of order one. Such a function would have 
exactly one simple pole in each cell which is inconsistent with a residue sum of 
zero (property 3 above). 

(5) An elliptic function of order r assumes every value exactly r times in any 
mesh counting multiplicity. 

(6) The sum of an irreducible set of zeros is congruent to the sum of an 
irreducible set of poles with each zero and pole being repeated according to its 
multiplicity. 

(7) Two elliptic functions, which have the same periods, the same poles, and 
the same principal parts at each pole differ by a constant. 

(8) The quotient of two elliptic functions whose periods, poles, and zeros are 
the same (including the multiplicities of the poles and zeros) is a constant. 


Figure 4-1: Two different ways of choosing period parallelograms for a given point-lattice 
Q. in the complex (Argand) plane. 


Elliptic Functions 65 


All elliptic functions with the same periods 2@ and 2! form a field K 
(Section 3.1). Thus any rational function (with constant coefficients) of elliptic 
functions with the same periods belongs to K. Moreover, the derivative of any 
function in K belongs also to K so that K is a differential field. However, an 
integral of a function of K does not necessarily belong to K. Although 
{2@,2@'} is a pair of primitive periods for some functions in K and a pair of 
periods for all functions in K, it is not necessarily a pair of primitive periods for 
all functions in K. Any two elliptic functions f and g of the field K can be 
shown to be connected by an algebraic equation Pf,g) = 0 where f(x,y) is a 
polynomial with constant coefficients and the algebraic curve (x,y) is 
unicursal. In addition, any elliptic function f satisfies an algebraic differential 
equation of the first order Pf’) = 0 where (x,y) again is a polynomial with 
constant coefficients and genus zero. Any elliptic function, f(z), can be shown 
to satisfy an algebraic addition theorem A[f(u),f(v),f(ut+v)] = 0 where 
A(x,y,Zz) is a polynomial whose coefficients are independent of u and v. 
Conversely it can be shown that a single-valued analytic function of z which 
satisfies an algebraic addition theorem of the above form is either a rational 
function of z, a rational function of e4 for some A, or else an elliptic function. 

The simplest nontrivial elliptic functions are functions of order two. 
These may be of either of the following two types: 

(1) One double pole with residue zero in each cell leading to the Weierstrass 
type of elliptic functions discussed below; 

(2) Two simple poles with residues equal in magnitude but opposite in sign 
in each cell leading to the Jacobi elliptic functions discussed above. 

These considerations lead to the definition of the Weierstrass elliptic 
functions in terms of their periodicity rather than by inversion of the integral in 
(4.1-20). The Weierstrass function 9(z) = g(zlw,q’) is thus uniquely defined 
as an elliptic function of order two with periods 2@, 2@' and a double pole at z 
= 0 with principal part 1/z2 around which ga(z) — 1/z2 is analytic and vanishes 
atz=(Q. An analytical expression for @(z) can be obtained by constructing a 


meromorphic function with double poles and principal parts nbs at all 


(z - w) 
points w= Wy,. The partial fraction expansion of such a function can be 
obtained by the following equation: 
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1 > 1 1 
f@=35+ — - (4.2-5) 
zt aay g| CoP "| 


Moreover f(z) 5 vanishes at z= 0. By rearranging the series it can be proved 


that f(z + 2@) = f(z) = f(z + 2w’) leading to the conclusion that f(z) = (z) or 


p= ptzioa)=+ — 6 - nl 
z (n De, @2mainal? (2m@+2nw')2 


(4.2-6) 
The function s9(z) is an even function of z. 
The derivatives and integrals of the Weierstrass function ga (z) are of 
interest.3 Differentiating s0(z) gives the following equation: 


1 
2 by ra (4.2-7) 


all (m,n 


2 1 
§a'(z)=--3- 2 Sy a Se 
a ert 


Integrating ga (z) term by term gives the Weierstrass zeta function, which is a 


meromorphic function with simple poles, i.e., 


ae 1 
C(z) = C(z lo, @') = = + Sy [ew + i + | (4.2-8a) 
(m,n}#(0,0 


(z) =<) (4.2-8b) 


Note that the function C(z) is an odd function of z (ie., C(z) = —C(-z)) and is 
not doubly-periodic and therefore not an elliptic function. The variables 7 and 
7] are used to express the lack of periodicity of C(z) as follows: 


C(z + 2@) = C(z) + 27 (4.2~9a) 

C(z + 20) = C(z) + 277' (4.2—-9b) 
Integrating C(z) around a cell gives Legendre’ s relation, namely 

no — no = 1/2 in (4.2-10) 


The Weierstrass sigma function is an entire function whose logarithmic 
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derivative is the zeta function, i.e., 
; | z z rz 
0(z) = o(zl,@,@') = z (: _ 77) exp| = + Aw } (4.2—11a) 
(m,n)A(0,0 


C(z) 20) (4.2~11b) 
o(z) 


= —d2[In 6(z)] 7 o'2(z) —a(z)o"(z) 


§9 (z) dz2 022) (4.2—1 Ic) 


Using the abbreviations 


go = 60 > + and g3 = 140 »y 4 (4.2-12) 


(mnyrO, (mnye0,3 
the power series expansion of o(z) and the Laurent series expansions of C(z), 
§2(z) and ”9(z) in the neighborhood of the origin are as follows: 


n/a «LE sc SARS 9) 5 L - 
O(z) =z 24.3.5 ~ 23.3.5.7  29.32.5.7  27.32.52-7.11 °°" (4.2-13a) 


et re 7 Ys So 9 LO -13b 
G@) = 2 * 22.3.5 22.5-7  24.3-52-7 hs os 
Pe ee Lae sea 9c x 
Q@=—5 * 92.5 +927 ary Oy Naa eee) 
jane = Gee a2 pene 4: (4.2—13d) 
99 3 25 7 93.52 °°" ’ 


Formulas with the Weierstrass elliptic functions can be expressed more 
symmetrically using the notation @1 = @, @2 =—@-— @', and @3 = w' and 
Ta = €(@g) (a = 1,2,3) leading to the following equations (a = 1,2,3): 

C(z + 2a) = €(z) +279 (4.2-14a) 

O(z + 2@,) = —O(z)exp[2Ng(z+@q)] (4.2—14b) 
The following three functions can then be defined for a = 1,2,3: 
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O(z + Wa) 


O(@y) 


Oa(z) = exp(-Z7]a) (4.2-15) 


The function g'(z) is an odd elliptic function of order three with 
periods 2@, (a = 1,2,3) with three zeros in every cell. Now (a'(-@g) = §2'(@a) 
since 40' has a period 2@g and ¢9'(—@q) = —f2'(@q) since ga’ is an odd 
function of z. Therefore z = @g (a = 1,2,3) is seen to be an irreducible set of 
zeros for g'(z) which are customarily designated as 

€q= §(@q) for a= 1,2,3 (4.2-16) 
The function (9 (z) — 49(@g) is an elliptic function of order two with double 
poles at points congruent to 0 and double zeros at point congruent to @g. Since 
it is of order two, these are the only poles and zeros so that the function 
V f2(z)—@q_ can be defined as a single-valued function but not necessarily 
with periods 2@ and 2a’. 

The functions g9'2(z) and [ 9(z) — e1][ @ (Zz) — e2][ (Zz) — 3] are both 
elliptic functions of order six with the same periods (i.€., 20 for a = 1, 2, 3), 
the same irreducible set of double zeros at Mg, (& = 1, 2, 3), and a pole of order 
six at 0 so that their quotient is constant. Using equations 4.2~6 and 4.2~7 an 
algebraic differential equation for the Weierstrass go function can be obtained, 
1.03; 

§o'2(z) = 4[ (z) — er @@) — er] O@ - 23] (4.2-17) 
An alternative form of this differential equation can be obtained from the 
observation that 9'2(z) —4 03(z) + 22 9(z) is an elliptic function of no more 
than order six and that all possible poles of this function are congruent to 0. 
Equations 4.2—13c and 4.2—13d indicate that this function is regular at z = 0 and 
therefore a constant. Furthermore, equations 4.2-13c and 4.2—13d indicate the 
value of this constant to be -g3 leading to the following alternative differential 
equation for go’: 

g'2(z) = 4 03(z) ~ g2E(z) — 83 (4.2-18) 

A comparison of the right sides of equations 4.2-17 and 4.2-18 shows that é1, 
e2, and e3 are the three roots of the algebraic equation 4z3 — g9z — 93 = 0 leading 
to the following relationships between these roots using formulas for the sums 
of the roots of algebraic equations (compare equations 3.3-8 and 3.3-9): 
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€] + e2 + e3=0 (4.2-19a) 
—4(e2¢3 + €3¢1 + €1e2) =0 (4.2-19b) 
4e1€2€3 = 3 (4.2-19¢) 
et? +692 +032 = Ip 99 (4.2-19d) 
e13 + e23 + €33 = 3/4 93 (4.2-19¢) 
e14 + e74 + 634 = Vg p92 (4.2-19f) 
16 (e2 — €3)*(e3 — e1)?(e1 — €2)? = g23 — 27932 =A (4.2-19g) 


The quantity A is the discriminant of the cubic equation in go(z) on the right side 
of (4.2-18). 

The Weierstrass elliptic functions can be related to the Jacobi elliptic 
functions discussed in Section 4.1.3 Thus consider the Jacobi elliptic function 

w = sn u = sn(u,k) (4.2~—20) 
defined as in equation 4.1-9. The variables u and k of this Jacobi elliptic 
function bear the following relationships to the variable z and the parameters ex 
(k = 1,2,3) discussed above for the Weierstrass elliptic functions: 


€1 2 €2: €3 = (2-k2) : (2K2~-1) : (1 +k?) (4.2-21a) 
z= Es = u=zVe;—e3 (4.2-21b) 
k2 rae (4.2-21¢) 
k2= 1-12 oe (4.2-21d) 


The Jacobi elliptic functions can then be expressed as follows using the 
Weierstrass functions: 


ghGi se ae eg (4.2-22a) 


V 9 (2) -¢3 o3(2) 


supe oe a mH) 
Vee, %3@ 


(4.2~22b) 
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Ve@()-e2 _ 922) 
dn(u,k) = 2 
Vp @—e 3 


The Weierstrass elliptic functions exhibit a number of addition theorems 


(4.2~22c) 


of interest in the context of algebraic equations.3 The addition theorem of the 
go-function may be written in several forms, i.e., 


meme 5 ty 2 
pur) =4 , [eee ~ pu) - E) (4.2-23a) 


1 gw §2'(u) 
1 gp) ga'(v) = 0 (4.2—23b) 
1 g(utv) -—g@'(uty) 


res vn ee) 

gtuty) = elu) sa PTS 
_ om i 2 {| 2iwm=2' : 
OO ~Zav eo oe ae 


2 
flue) + pu») =20W-Slinlew- ew) 42-234) 


These addition theorems may be proved by observing that the functions on the 
two sides of the equation are elliptic functions with the same periods, poles, and 
principal parts, and have the same value at some specified point. From these 
addition formulas for §9(u) the following related formulas can be derived: 
Li] 2 
Q22)=-2 p+ Fea (4.2~24a) 
2 §2'(z) 
eChz2) = o@)+V e(2)-e2 V Ml) -e3 + 
V @(z)-e3 V @(2)-e1 + 
Ve) -e1 V p()-e2 (4.2-24b) 
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In equation 4.2~24b the square roots are taken so that z = 0 is a simple pole 
with residue unity thereby implying 


p')=2Ve@)-aVe@-ave@-es (4.2-25) 


Some related expressions can be derived for the Weierstrass zeta and 
sigma functions, e.g., 


1 g'(u) — e'() 
= = (4.2-26a) 
Cuty) = C(u) + Cv) +5 Siu 
o(u+v)o(u-v) = — 02(u)o2(v)[ E(u) -— EV)] (4.2-26b) 


Equations 4.2—26 are not true addition theorems since the right hand side 
contains functions other than the function on the left hand side. The following 
equations can be derived from equations 4.2—26: 


C(z + 2mm + 2ng') = C(z) + 2m + 2nN! (4.2-27a) 


O(z + 2m@ + 2nw') = (-1)™*"*m GO(z) exp[(ztma@tnw')(2mn+2n7) 
(4.2-27b) 
In these equations m and n are integers and 7) and 77 are defined as in (4.2-9). 
(a '(u) - p'(v) 
f2 (u) - e(v) 


functions and §2(u) — (v) in terms of sigma functions. 


These equations are derived by expressing in terms of zeta 


Another function of interest is the quotient o(nu)/o"(u), since the case 
where n = 5 (i.e., 6(5u)/025(u)) is used in the solution of the quintic equation. 
A useful formula for this function attributed to Kiepert is given by Schwarz ina 
set of notes originating from Weierstrass and published in 1893.4 This set of 
old notes is a useful source for other relatively obscure information on elliptic 
functions, particularly those of the Weierstrass type. The general formula for 


4H. A. Schwarz and K. Weierstrass, Formeln und Lehrsdtze zum 
Gebrauche der Elliptischen Functionen, Springer, Berlin, 1893, p. 19. 


72 Beyond the Quartic Equation 


o(nu)/o""(u) can be expressed as follows where ga”) is the nth derivative of 


§a: 


gitu) gu) .. gpeDu) 
OW ae e"u) g™u) .. ew 
ot (u) (1!2!13!...(n-1)!)2 i 


eeDu) pMu) ... ge@r-3u) 
(4.2-28) 


The proof of this rather bizarre formula is beyond the scope of this book but can 
be found in the notes compiled by Schwarz.4 


4.3 Theta Functions 


Theta functions>-6 are periodic functions that can be represented by 
series whose convergence is extraordinarily rapid. They are important in 
providing the best means for the numerical computation of elliptic functions. 
For this reason theta functions are useful in the solution of algebraic equations 
requiring the use of elliptic functions. 

Consider a Weierstrass function £2 (zlw,’) and define t =w/w selecting 
@ and @' so that Im t> 0. Furthermore define g = e!@T =e/M@/ and y = 2/2M. 
Then the four theta functions can be defined as follows using the designations 
of Erdelyi et al: 


61(v) = 81(v,q) = 8; (vit) = 2 Yq Yi1ygnt Dsin[(2n+1) nv] (4.3-1a) 


n=0 


Qo(v) = 02(v,q) = Oa(via) = 2 Yq Yigt*Deos[(2n+1) nv] (4.3~-1b) 
n= 


5K. Chandrasekharan, Elliptic Functions, Springer, 
Berlin/Heidelberg, 1985, Chapter 5. 


6D. Mumford, Tata Lectures on Theta I, Birkhéuser, Boston, 1983. 


Elliptic Functions 73 


03(v) = 03(v,q) = 03(vit) =1+2 >.q"’cos(2nnv) (4.3-1c) 
n= 

Q4(v) = @4(v,g) = O4(viz) = 142 Y(-1)"%g"?cos(2nnv) (4.3-1d) 
n= 


These series converge for all (complex) v and all q defined as above. The factor 
qr leads to extremely rapid convergence. The function 0] is an odd function 
(8;(—v) = -91(v)) whereas the functions @2, 03, and @4 are even functions 
(8,(v) = 8, (-v) for n = 2, 3, 4). The trigonometric functions in the theta series 
can be converted into exponential functions leading to the following>: 


oo 


Qi) =i Yd ge a e2n-Dinv (4.3~2a) 


n=-—oo 


Oo(v) = dig) e(2n-1inv (4.3-2b) 


nN=-0o 


03(v) = >. ghe2ninv (4.3~2c) 


n=—0o 


Q4(v) = Y(-1)nq"?e2ninv (4.3-2d) 


N=-—-0o 


The four theta series (4.3-2) are Laurent expansions in the variable ei and 
converge for all finite non-zero values of this variable. 

All four theta functions are entire periodic functions of v with the period 
of @; and 62 being 2 and that of 63 and 04 being 1 (Table 4~1). Table 4-1 also 
shows their behavior under addition of half and quarter periods using the 
following abbreviations: 

A(v) = eR (4.3~3a) 

B(v) = ett "a (4.3-3b) 
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Table 4-1 shows that all four theta functions may be generated from any one of 
them by the addition of quarter-periods. From Table 4-1 0;(¥) is seen to have 
a zero at v = 0 and thus zeros at m + nt where m,n here, as elsewhere in Table 
4~1, are integers. It can be proved by integrating 01'/0) over the boundary of a 
parallelogram with vertices +1/2+1/27 that these are the only zeros of 61(¥). 


Table 4-1: Some Properties of the Four Theta Functions 


01(y) 82(v) 03(v) 84(v) 
ZeTOS m+nt mtYoent —o m+lot(ntly)t — m+(nt!/2)t 
y—-v —61(v) 82(v) 83(v) 84(v) 
vove+l ~O;(v) ~O2(v) 63(v) 64(v) 
V—-V+T ~A(v)8;(v) A(v)Oo2(v) A(v)03(v) ~A(v)04(v) 
vVv+1l+T A(v)81(v)_ —A(v)82(7) A(v)93(¥) —A(v)04(7) 
 povtl/y 82(v) -81(¥) 84(v) 03(v) 
v—v+l/ot iB(v)O4(v) ~—B(v)83(7) B(v)02(v) iB(v)81(y) 
vovelptl/ot  B(v)03(v) -iB(v)04(v) iB(v)01(y) B(v)62(v) 


Information on the zeros of the theta functions can be used to obtain 


infinite products representing the theta functions. Define qq as follows: 


qo= | [a-9?") 
n=1 


(4.3—4) 


Then the following infinite products for the theta functions are obtained: 


4 ive] 
@1(v) = 2goVq sin nv [][G-2¢2*cos2av + g4") 


n=1 


% oo 
82(v) = 2goVq cos nv [Ic 14+2q2"cos2nv + q4") 


n=] 


(4.3—5a) 


(4.3—Sb) 
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03(v) = go | [(4+2q2"-!cos 2nv + q4-2) (4.3~5c) 
n=] 

04(v) = go | [-2q2"-!cos 2nv + q4”-2) (4.3-5d) 
n=] 


These equations (4.3—5) are valid in the entire complex plane representing the 
complex variable v. 
The following relations between squares of theta functions of the same 


variable are of interest: 
612(v)822(0) = 042(v)032(0) — 032(v)042(0) (4.3-6a) 
@12(v)032(0) = 042(v)822(0) — 022(v)042(0) (4.3-6b) 
612(v)042(0) = 032(v)@22(0) — 22(v)032(0)  (4.3-6c) 
042(v)®42(0) = 032(v)032(0) — 622(v)822(0) (4.3-6d) 


Each of these equations may be proved by noting that the ratio of its two sides 
is a doubly periodic function with periods 1 and t without zeros or poles and is 
thus a constant. This constant can be evaluated using special values of v (half- 
periods). 

The “theta functions of zero argument” (i.e., v= 0 in equations 4.3—1 to 
4.3—-3) are particularly important and satisfy the following identities: 

01'(0) = 762(0)03(0)64(0) (4.3—T7a) 

674(0) + 644(0) = 634(0) (4.3—7b) 
Graphs illustrating the behavior of theta functions of argument zero are given 
elsewhere.’ 

Theta functions are very closely related to the Weierstrass sigma 
functions and can be used for their numerical evaluation because of their 
extremely rapid convergence.? The following equations then express the 
various Weierstrass functions in terms of theta functions using the notation in 
equations 4.2—11 to 4.2~14 where v =2/2o: 


2 
o(z) =2@ ent ro 10,0 — (4.3-8a) 
1 


TF. C. Tricomi, Funzioni Ellittiche, Zanichelli, Bologna, 1951; German 
edition, Akad. Verlagsgesellschaft, Leipzig, 1948. 


76 Beyond the Quartic Equation 


_mz, 01') ? 
C(z) = ase iO (4.3-8b) 
1 | 81'(0)8g41(Y) 2 
= reer h = ae 
§0(z) = @g+ | et where a = 1, 2,3 (4.3-8c) 
62(v)83(v)84(v)01'3(0) 
e'(2) = - Se 4.3-8d 
4@382(0)03(0)04(0)013(v) 
122e, = n2[834(0) + 644(0)] (4.3-9a) 
12@2e = n2[04(0) — 644(0)) (4.3-9b) 
122e3 = —n2[024(0) + 034(0)) (4.3-9c) 
Jez ~e3 =i Vez ~e2 = 582) (4.3-10a) 
Je; —e3 =i Ve,—e] = 583) (4.3-10b) 
Je; ~e2 =i Vez—e] = 5840) (4.3-10c) 
e=% ey) [08(0) + 038(0) + 048(0)) (43-112) 


83 -a(x) [824(0) + 634(0)][634(0) + @44(0)}[844(0) — 624(0)] (4.3-11b) 


4 
=—_ 9;'2(0) = Ls) 2 4,3-12 
A= to (0) E [02(0)3(0)84(0) (4.3-12) 
61"'O) it 70,;"'O) } 
rend eee oral ee 4, —13 
1248 1'(0) H 24a?6 1'(0) ran 


Equation 4.3—8a may be proved by noting that the quotient of the function on its 
two sides is a doubly periodic function without poles or zeros and approaches 1 
as v and z approach 0. Equation 4.3-8b can be obtained by logarithmic 
differentiation of (4.3—8a). 
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The Jacobian elliptic functions can also be expressed in terms of theta 


functions leading to the following equations using the notation in (4.1-9) to 
(4.1-11): 


02(0) 
Vk = 0510) (4.3—14a) 
7 = 240) (4.3—14b) 
63(0) 
_ 8300)01) ; (4.3-15a) 
62(0)04(y) 
= 84(0)02(v) (4.3—15b) 
02(0)04(v) 
= 24(0)03(V) (4.3-15¢) 
03(0)84(v) 


Thus given t= @'/@, (4.3-14a) determines the modulus k of the Jacobian 
elliptic functions and equations 4.3—15 determine the functions themselves. 


Many applications of elliptic functions require calculation of a gq when 
ig| < 1 from a given k2 defined by 


42 2240.9) _, _ 9440.9) _ 


= 1 — (k')2 4.3-16). 
634(0,q) 834(0,q) e : 


This is called the inversion problem. Frequently 0 < k? < 1 and 
044(0.q)_ | Fd - 92"! 
4,3~17) 
034(0,9) 4 a 3. Ge ge) 
by equations 4.3—5. As qg increases from 0 through real values to 1, the infinite 


product decreases monotonically from 1 to 0 so that (4.3-16) has exactly one 
solution g in the range 0 <q <1. 
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The inversion problem thus involves the calculation of a value of the 
theta function parameter g corresponding to a set of roots é1, €2, and e3 of the 
cubic equation 4.2~-17 = 4.2-18. The first step involves calculation of the 
parameter L defined by the following equation: 


1 — Vk' Vey = e3 Vel 8) 
= ie (4.3-18). 
1+vki 4 


4 
Ve, — e3 + Vei —e2 


The corresponding value of qg can then be obtained by inversion of the 


equation’ 
L +g? + q?> + ... 
Bd 43-1 
2° 14+2¢q4 + 2q16 + ... nee 


to give the so-called Jacobi nome 


13 


5 9 4j+1 
q= (=)+ 2(7) +12 (5) +150 (=) ae > qj (=) (4.3-20) 
in which the coefficients q; form the series’ 


1; 2; 15; 150; 1707; 20,910; 268,616; 3,567,400; 48,555,069; 
673,458,874;  9,481,557,398;  135,119,529,972; 1,944,997,539,623; 
28,235,172,753,886 

Equation 4.3—18 poses some difficulty since for a given ordering of the 
roots €1, €2, €3 of the cubic equation 4.2-17 there are four fourth roots 
Ne, es and four fourth roots Ve] — e2. The 16 possible combinations of 
these two fourth roots reduce to four possible values of L when the quotient 
(4.3-18) is taken. In addition, there are six permutations of the three roots e1, 
e2, and e3 leading to (4)(6) = 24 possible g values for a given cubic equation 
4.2-17. In general, half of these possible q values will have !ql > 1 and 


8A. N. Lowan, G. Blanch, and W. Horenstein, On the Inversion of the 
q-Series Associated with Jacobian Elliptic Functions, Bull. Amer. 
Math. Soc., 48, 737-738 (1942). 
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therefore are unsatisfactory since the series for the corresponding theta 
functions do not converge. Early work by Weierstrass‘ suggests the following 
criteria for choosing the correct ordering of e1, e2, e3 and the correct fourth 
roots in equation 4.3—18: 

(1) The labelling e1, e2, e3 of the roots of equation 4.2—17 is chosen so that 
neither the expression 4.2-21c for k2 nor 4.2-21d for k’2 has a negative real 
value and also neither of the two expressions has a positive real value which is 
equal to or greater than one; 

(2) The initial square roots required to convert the expressions 4.2—21c for 
k and 4.2-21d for k'2 to k and k’, respectively, are taken so that the real parts 
are positive leading to a positive real part for the quotient f : 


(3) The fourth roots are taken so that the quotients 


4 4 

aes 22 a arn Sars (4.3—21a) 
Vel — €3 

| Ve -—e ‘Tepe 

VE =2A=£2 = [E182 (4.3-21b) 
Ve} —€3 


have the principal values. 
4.4 Higher Order Theta Functions 


The definition of elliptic integrals by an equation of the type 


dx 


(4.4-1 
V P(x) : 


f@) = 


where (x) is a cubic or quartic polynomial can be extended to hyperelliptic 
integrals where P(x) is a polynomial of degree 5 or higher. The corresponding 
higher order theta functions are required for solutions of general algebraic 
equations of degree 6 or higher. 
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The simple theta functions discussed in the previous section may be 
expressed as singly infinite series of exponentials of polynomials which are 
quadratic in the summation index, e.g., from (4.3-2c), 


03(v) = digr?e2ninv (4.4-2) 
hn=—oco 
Since g = e'™ such functions can be expressed generally in the form 


Q(v) = Yexp(2ninv + ixtn?) (4.4-3) 


h=-co 


where T is a complex constant whose imaginary part is positive to ensure 
convergence.? There are three other functions functions gg connected with 
this one and obtained from by replacing n by n + !/20 and v by v + 1/28 where 
o and § may be 0 or 1. The four single theta functions are given the labels 
61(v), 82(v), 83(v), and @4(v) in the previous section (e.g., equations 4.3—2). 
Various ratios of these single theta functions are elliptic functions (e.g., 
equations 4.3-8). 

The concept of theta functions can be generalized to a multiply infinite 
series with several arguments in which Tt is now a symmetric square matrix 
rather than a single complex number. The genus of such a theta function is the 
size of its t matrix. The genus 2 or double theta functions are important in the 
solution of the general sextic equation and can be written as follows: 

Bop(v) = LLexp[2ni(n+!/20)(v+!/oB) + nit(nt}/pa)2] (4.4-4) 
in which Tt is a 2 x 2 symmetric matrix, i.e., 


t= Gee and ty = 11 (4.4-5) 
721722 


and all other letters are row designations standing for pairs of letters 


OR. W. H. T. Hudson, Kummer’s Quartic Surface, Cambridge, 1905, 
Chapter XVI. 
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distinguished by suffixes 1 and 2. The summation is taken over all integer 
values of ny and n2 ranging from -co to +oe, The variables a; and a2 are 
integers which are taken to be 0 or 1 since the integer parts of !/2a may be 
absorbed in n, By and B2 are also integers, and since the addition of even 
integers to B can at most change the sign of the function, it will generally be 
supposed that B; and B2 are either 0 or 1. The matrix ts 

Bi Bo 
characteristic of the double theta function and can be indicated by the suffix of. 
The designation aB denotes «16, + o2B2, and the parity of the theta function 


jis called the 


depends upon the parity of this expression since by taking a new pair of 
summation letters n'=-—n —a, the order of the terms is changed without 
altering the value of the function. 

There are 22? =16 different characteristics for the double theta functions 
since each of the four elements a}, 2, 81, and B2 can be 0 or 1. The double 
theta function in which all of the elements are 0 is called the zero characteristic. 
Corresponding to these double theta functions of zero characteristic are 16 half 
periods 1/2(ta + B). The effect of adding a half period to the argument by a 
theta function is to change the characteristic and multiply by a nonvanishing 
function. Of the 16 characteristics six are odd and ten are even. Hence six 
double theta functions are odd functions and vanish for v = 0. 


Chapter 5 
Algebraic Equations Soluble by Radicals 
5.1 The Quadratic and Cubic Equations 


The general monic quadratic equation can be written as 

x2 +a\x+a2=0 (5.1-1). 
The Galois group of this equation, $2 ~ C2, has only the two elements 
{1, (12)} and the corresponding alternating group A2 ~ C1 contains only the 
identity element. The trivial normal series can be described as Go = Cy = Aj 
<1 C2 = Sz with a factor group S2/A2 = C2. The two roots of the equation are 
obtained by the following well-known quadratic formula: 


—~ayt “J oe 
XE 2 HEN Ge k= 1,2 (5.1-2) 


The C2 factor group corresponds to the square root in equation 5.1-2. The 
nonidentity permutation of the Sz = C2 Galois group of this equation, namely 
(12), is seen to permute the two roots by changing the sign of the radical. The 
quantity under the radical in (5.1-2), namely a 2 — 4a2, is called the 
discriminant, D, of the quadratic equation. If D = 0, the two roots of the 
quadratic equation are equal, namely x) = x2. If D > 0, the quadratic equation 
5.1—-1 has two distinct real roots whereas if D < 0 the quadratic equation has 
two distinct nonreal complex roots which necessarily are complex conjugates. 

The solution of the general monic cubic equation! 

x3 + ayx2 + anx + a3=0 (5.1-3) 
is much more interesting. The Galois group of this equation, $3, with 3! = 6 
elements, is isomorphic with the symmetry group of the equilateral triangle, 
namely D3,. The S3 group has the following normal series and associated 
factor groups: 


1E. Dehn, Algebraic Equations, Columbia University Press, New York, 
1930. 
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Go=C1 9 C3 =A3 9583 | (5.1-4) 
A3/C, =C3 (5.1-Sa) 
S3/A3 = C2 (5.1-5b) 


The solution of such an equation based on this normal series is thus seen to 
involve a square root from the factor group 53/A3 = C2 inside the cube root 
from the factor group A3/C, = C3. 

In the actual algorithm to solve such a cubic equation (5.1-3)2 the 
equation is first simplified by applying a Tschirnhausen transformation (Section 
3.3) to the general cubic equation (5.13) to give the following reduced cubic 
equation with no term of degree 2: 


z3 + boz+ b3 =0 (5.1-6) 
where . 
x=z- 2 (5.1-7a) 
_ai2 
bc $22 = a (5.1—7b) 


a 
b3 = 241° = Pag + 2743 (5.1~7c). 


The remainder of the algorithm consists of methods for determination of the 
roots of the reduced cubic equation (5.16). In this connection we seek to 
determine two numbers u and v in such a manner that 

zZ=Ut+yV (5.1-8) 
will satisfy equation (5.1-6). Substitution u + v for z in equation (5.1-6) gives 
the following equation: 

u> + v3 + 3(uv + bo/3)(u + v) + b3 =0 (5.1-9) 
Equation (5.1-9) will be satisfied if u and v are chosen in accord with the 
following relationships: 

u3 +3 = b3 (5.1—10a) 

uy ==92 | (5.1-10b) 
Solving equations (5.1-10) simultaneously gives 

uo + b3u3 — be =0 (5.1-11). 


Solving equation 5.1~11 as a quadratic equation in u3 gives 


2N. B. Conkwright, Introduction to the Theory of Equations, Ginn, 
Boston, 1941, Chapter 5. 


84 | . Beyond the Quartic Equation 


4b23 
—bat 2 ee So 
tN or + Pag [oP 2 


aye ae (5.112) 
If uj denotes any of the cube roots of the number on the right hand side of 
equation (5.1—2), the three cube roots are uj, Mu, and w2u; where ® = 
exp(27i/3) (i.e., a complex cube root of yee The corresponding values of v, 


obtained from equation (5.1—10b), are — on rae and ae or 1-3 
Ou] uJ 
wb wb2 


a and — 3u, , Tespectively, since w3 = 1. Now if -g is denoted by 
v1, then the three roots of the reduced cubic equation (5.1-6) may be written in 


the following form: 


Zj =U, +Vq (5.1-—13a) 
Z2 = MU + O V4 (5.1-13b) 
73 = @2u, + Ov] (5.1-13c) 


Combining equations (5.1-12) and (5.1-13) leads to the following relations, 
called Cardan’s formulas, for the roots zg (k = 1, 2, 3) of the reduced cubic 
equation (5.1~-6): 


3 er 


—b b32  b23 —b b32 _ b23 
ae Re Ee VE VE Gro 


In (5.1-14) the cube roots for the two terms are selected so that their product is 
always —b2/3 in accord with equation (5.1-10b). This solution of the general 
cubic equation (5.1—3) thus relates to the factorization of its S3 Galois group 
with the C3 factor (5.1—5a) corresponding to the outer cube roots in equation 
5.1-14 and the C> factor (5.1—5b) corresponding to the inner square roots in 
(5.1-14) arising from solution of the quadratic equation (5.1—12) in u3, 

The cubic equation, like the quadratic equation, has a discriminant that 
determines the nature of its roots. The discriminant of the general cubic 
equation 5.1~3 is essentially the quantity under the square root signs in equation 
5.1-14, which is conveniently written as . 

A = 4b73 + 27b32 (5.1-—15) 
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The following three situations cover all possibilities for the three roots of a 
general cubic equation (5.1~3): 
(1) A=0 = there is a multiple root; 
(2) A>0 => there is exactly one real root and two nonreal complex conjugate 
roots; 
(3) A <0 = the three roots are real and distinct. 
The case of three real and distinct roots is historically called the casus 
irreducibilis (irreducible case),3 since evaluation of the roots of such equations 
using Cardan’s formulas (equation 5.1—14) requires taking real cube roots of 
nonreal numbers. Thus, although the roots of such cubic equations can be 
expressed using only radicals in addition to simple arithmetical operations 
(addition, subtraction, multiplication, and division), evaluation of these radicals 
in these expressions is not feasible. This difficulty is circumvented by using 
trigonometric functions rather than radicals to solve such cubic equations with 
three real roots. | 

The radicals in Cardan’s formulas (equation 5.1-14) may be evaluated 
using trigonometric functions by first using the relationships: 


ocean aN ee 


(5.1—16a) 
: 2 3 
= / oe ‘ 22 ne (5.1-16b) 


The absolute value of the complex number on the right hand side of equation 


—h3 
(5.1-16a) is Nes and the amplitude may be found by the following 
relation: 


~b3 


—b23 
27 


cos d= (5.1-17) 


2 


3D. E. Dobbs and R. Hanks, A Modern Course on the Theory of 
Equations, Polygonal Publishing House, Passaic, NJ, 1980, Chapter 3. 


86 | Beyond the Quartic Equation 


The angle ¢ terminates in the first quadrant (i.c., 0° < ¢ < 90°) if b3 is negative 
(ie., cos @ > 0) and in the second quadrant (i.e., 90° < @ < 180°) if b3 is 
positive (i.e., cos ¢ <0). This leads to the following expressions for the 
radicals in Cardan’s formulas in terms of trigonometric functions: 


ve 3 3 
PfEFFeweriiny avis 
2 3 _ho3 
fe + va = (os o—isin 9) (5.1-18b) 


Taking the cube roots of these equations gives the following where k = 1, 2, 3: 


3 
a pal bak. = 
3,4] 23- + be a\ = =b2 (cas = $t2ke i sin etcke ) (5.1-19a) 


2 3 - 
7x 22 32 (cos eee isin ereke ) (5.1-19b) 


These equations lead to the following formulas for the roots of the reduced 
cubic equation (5.1-6) in terms of trigonometric functions: 


i Eee | (5.1-20a) 
¢ = cos! ; -by3 | 
2] 


ES NE co 2 ™ bs 7) pot os (5.1-20b) 


In equations 5.1—20 the C3 factor group corresponds to the division of @ + 2kn 
by 3 before taking the cosine (5.1-20b) whereas the C 2 factor group 
corresponds to the square roots in the formulas. Note that the sequence of steps 
5.1-20a and 5.1-20b first involves taking the inverse cosine of one number 
followed by taking the cosine of another number derived from the first inverse 
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cosine after dividing by 3. This procedure is analogous to evaluating a cube 
root by taking its logarithm, dividing the logarithm by 3, and then taking the 
inverse logarithm of the quotient. Both the logarithms needed in general to 
evaluate radicals (e.g., the square and cube roots in equation 5.1-14) and the 
inverse cosine needed for equation 5.1~20a can be expressed as integrals of the 
type (compare equation 4.1—3 in Section 4.1) | 


d 
fy) = (= (5.1-21) 
[aor 


in which Ay) is a quadratic polynomial, i.e., y2 for In a and 1-y for -cos~la. 
This makes readily apparent the close relationship between the two methods for 
solving cubic equations, namely that represented by equation 5.1~14 and that 
represented by equations 5.1-20. 


5.2 The Quartic Equation 


Increasing the degree of the algebraic equation from 3 to 4 introduces no 
fundamentally new features. The Galois group of the general quartic, $4, with 
4! = 24 elements, is isomorphic to the symmetry point group of the regular 
tetrahedron Tg (Section 2.3). The S4 group has the following normal series 
and associated factor groups: 


Go =C1<1C2 D2 Ag 84 =Ty (5.2-1) 

CofC, = C2 (5.2-2a) 
D/C2 = C2 (5.2~2b) 
A4/D2 = C3 (5.2-2c) 


cubic equation (see section 5.1) 


Sa/A4 = C2 ~ (5.2-2d) 


Thus the solution of a quartic equation can be reduced to the solution of a cubic 
equation (called the resolvent cubic equation) corresponding to the C3 and C2 
factor groups in equations 5.2—2c and 5.2—2d, respectively, followed by 
solutions of two additional quadratic equations corresponding to equations 
5.2—2a and 5.2-2b. 

A classical method for solving the quartic equation which has these 
features is Ferrari’s solution of the general quartic equation.2 Consider the 
following general monic quartic equation: _ 


88 ; Beyond the Quartic Equation 


x4 + ayx3 + anx2 + agx+a4q=0 (5.2~3) 
Determine the numbers a, b, and k such that 
2 
A+ ay + ap? + ape + ag + (ax + bY (x? + Fe + k) (5.24) 


by equating the coefficients of like powers of x on both sides of this equation 
leading to the following relations: 


2 
From the x2 term: az +a =2k+ + (5.2~5Sa) 
From the x term: 2 ab + a3 = kay (5.2—Sb) 
From the constant term: b2 + aq = k2 (5.2—5c) 


Solving the three simultaneous equations for a, b, and c leads to the following 
cubic equation for k, known as the resolvent cubic: 


2 
IBF K+ Haray — dag)k +5 (4anag ~ay2ag—a32)=0  (5.2-6) 


Solving this cubic equation (5.2-6) for k using either (5.1—14) (the radical 
method) or equations 5.1—-20 (the trigonometric method) followed by substitu- 
tion in (5.2—5a) and (5.2—5b) gives a and b. Combining equations 5.2—3 and 
5.2—4 gives the following equation in which both sides are perfect squares: 


(x24 Fx +k)? = (ax +b)? (5.2-7) 
Taking one square root gives the following two quadratic equations: 

4+ Fxtk=ax+b (5.2-8a) 

+ x +k=—ax-b (5.2-8b) 


The four roots of the original quartic equation 5.2—3 are then obtained pairwise 
from solutions of the two quadratic equations 5.2-8. 

The Ferrari method for solving the general quartic equation 5.2-3 
corresponds to the factor group structure 5.2—1 and 5.2—2 for the symmetric 
group $4 ~ Ty as follows: | 
(1) A4/D2 = C3 and S4/Aq4 = C2 correspond to the resolvent cubic equation 
5.2-6; 
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(2) D2/C2 = C2 corresponds to taking the Square roots of both sides of equation 
5.2~7; 


(3) CIC 1 = C2 corresponds to solving the quadratic equations 5.2—8. 
5.3 Special Quintic Equations Solvable by Radicals 


Consider an irreducible monic quintic equation, i.e. 

x? + ayx4 + agx3 + a3x2 + agx + a5 =0 (5.3~1). 
Its Galois group must be a transitive permutation group for the five roots x, (1 
<k <5). Table 2-3 indicates that there are five different transitive permutation 
groups of degree 5, namely C5, D5, M5, As =/, and Ss with 5, 10, 20, 60, and 
120 operations, respectively. Among these five groups the first three, namely 
Cs, Ds, and Ms, are soluble, whereas the remaining two groups, namely A5 
and Ss are not soluble. Thus irreducible quintic equations with C's, Ds, and Ms 
Galois groups are soluble by radicals whereas the more general irreducible 
quintic equations with As and Ss are not soluble by radicals. Berwick4 has 
characterized quintic equations with these Galois groups and thus soluble by 
radicals. 

The metacyclic group Ms contains the 20 permutations of the general 
type s/2" (m = 1,2, 3,4,5;n =1, 2, 3, 4) where s = (12345) andt = (1234) 
so that s° = ¢4 = 1 and st = ts? (or 1st = 52). The dihedral group Ds is a 
normal subgroup of M5 of index 2 containing the 10 permutations of the general 
type su" (m = 1, 2, 3, 4, 5; n= 1, 2) where u = 2 = (12)(34) and s5 = u2 = 1 
and su = us4 (or u-1su = s4). The cyclic group Cs consists of the single cycle 
s, s*, 53, s4,s5= 1. The cyclic group C5 consists of the fivefold proper 
rotations of the regular pentagon, i.e., C5” (n = 1, 2, 3, 4) and C55 =E 
whereas the dihedral group Ds consists of not only the fivefold proper rotations 
of the pentagon within its plane but also its twofold proper rotations (“flips”) 
out of its plane (Figure 5-1). The metacyclic group Ms contains not only all of 


_ the proper rotations of the regular pentagon but the proper rotations that can 


convert a regular pentagon into the pentagram having the same vertices as the 


4w. E. H. Berwick, The Condition that a Quintic Equation Should be 
Soluble by Radicals, Proc. London Math. Soc., (2) 14, 301 (1915). 
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original pentagon (Figure S—1). Furthermore the group Ms has the following 
normal series and associated factor groups: 


Go =C11C54I D5 Ms (5.3-2) 

C5/Cy = C5 (5.3—3a) 
Ds/C5 = C2 (5.3—3b) 
Ms/Ds5 = C2 (5.3~3c) 


4 4 
oe 7S oI, 


\_/ Ve 


1 
C,; and Ds: M;: pentagram 
pentagon | inside a pentagon 


Figure 5-1: The regular pentagon and the pentagram inside a pentagon and their 
relationships to the soluble transitive permutation groups of degree 5, Cs, D5, and Ms. The 
numbers on the vertices correspond to the indices k on the roots x, of the corresponding 
quintic. 


Consider the problem of solving a monic quintic equation 5.3—1 by 
radicals in a field of rationality K which contains the coefficients a, (1 <n <5). 
For the existence of such a radical solution of the quintic it is necessary and 
sufficient that some rational function of the roots xz (1 < k $< 5), which is 
unaltered by the 20 permutations of k in M5 but no others, should have a value 
rational in the field K. 

The quintic 5.3—1 can be reduced to the Bring-Jerrard form 

y? —Agy + As =0 | (5.3~4) 
by a Tschirnhausen transformation requiring solution of a cubic equation as 
well as a quadratic equation (Section 3.3). In general the coefficients Aq and A5 
of equation 5.3-4 belong to a larger field L where the extension L:K (Section 
3.1) is of degree 6 because of the nature of this Tschirnhausen transformation. 
The coefficients Aq and As are rational in the original field K only for a certain 
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limited class of quintics. Runge defines a function y of the five roots x; of the 
quintic: 
v= "a(xpx2 + x2x3 + 13x4 + x4X5 + x51 — X1K3 — x3X5 — 5x2 ~ x2x4 - x4x1)2 
(5.3~5) 

in which the products x;xz corresponding to the edges of the pentagon and 
pentagram in Figure 5-1 have positive and negative signs, respectively. He 
then shows that y is a root of the following sextic equation: 

YW) = (y— Aa)t((y2 — 6yAg + 25442) — S9Asty =0 (5.3-6) 
If (5.3-6) has a rational root in the field L, the quintic can be solved by radicals 
in L and consequently in K. Thus the problem of identifying quintic equations 
soluble by radicals consists of obtaining a sextic equation ©(@) = 0 satisfied by 
a function of symmetry M5 but with rational coefficients in K. Since two 
rational functions of the roots which belong to the same group are rational 
functions of each other, the sextic equation is connected with equation 5.3-4 by 
a transformation of one of the following types where y, and z, are rational in L: 

G=Yo+ yi + yoy? + 39 + yay + yoy (5.3~7a) 

W= 7 + 219 + 2262 + 2393 + 2464 + 2505 (5.3-7b) 
The discovery of roots of the sextic equation @(¢) = 0 rational in K is a simpler 
process that that of roots of ‘P(y) = 0 rational in L. Thus if the coefficients of 
the original quintic equation 5.3-1, a, (1 <n < 5), are ordinary rational 
numbers, the extended field L is in general a soluble sextic field, and the testing 
of a sextic equation whose coefficients belong to a sextic field for roots rational 
in that field, although possible in a finite number of steps, is a somewhat 
lengthy operation. 

The following function of the roots of the quintic equation 5.3—1 can be 
shown to be unaltered by the permutations of the metacyclic group Ms: 

@ = (1 ~ 2)? (2 — 3)2(x3 — x4)2(g — 25)2(5 - 21)? 

+ (¢1 —x3)?(x3 — x5)2(a5 — x2)?(x2 — x4)? (x4 — x1)? (5.3-8). 

In equation 5.3—8 the first and second terms correspond to the edges of the 


. pentagon and pentagram, respectively, in Figure 5-1. The number @ is one of a 


5C. Runge, Uber die auflésbaren Gleichungen von der form x5 + ux + 
v =0, Acta Math.,7, 173-186 (1886). 
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set of six Conjugates which may be obtained by the following six permutations 
of the indices k of the roots xg: re: 

1, (123), (132), (23), G1), (12) (5.3-9) 
These permutations on the first three indices 1,2,3 and leaving the indices 4 and 
> unchanged form the dihedral group D3. The group of each of the conjugate 
functions is a subgroup (although, of course, not a normal subgroup) of the 
symmetric group S5 conjugate to Ms. A symmetric function of the six @’s is an 
invariant® of the binary homogeneous quintic form 

f(x,y) = x5 + apxty + anx3y2 + a4x2y3 + agxy4 + asy5 (5.3-10) 
and the six elementary symmetric functions are rational integral invariants. 
Thus the six @’s can be assumed to be roots of a sextic equation | 
D(g) = G6 + Agg + Aggs + A123 + A16¢2 + A209 + Arg = 0 (5.3-11) 
in which Aq, Ag, A12, A16, A209, and A24 are rational integral invariants of 
degrees 4, 8, 12, 16, 20, and 24, respectively. 

The methods of invariant theory (Section 2.6)® can be used to evaluate 
the coefficients A4, Ag, A12, A16, A20, and A24 of equation 5.3-11. The 
quintic form f(x,y) in equation 5.3~10 has the following four irreducible 
invariants of which the first three are needed: 


degree 4:J=D =(i,i)? (5.3—12a) 
degree 8: K = (i3,H)6 (5.3-12b) 
degree 12: L = (i5,2)10 (5.3-12c) 
degree 18: / = (i7,fT)14 (5.3-12d) 


In equations 5.3~12 i is the fourth transvectant (f,)4 and thus of degree 2 and 
order 2, and D is the discriminant of i, which is obtained by the Hessian (i,i)2. 
Furthermore, H is the Hessian of f, namely (ff), and thus of degree 6 and 
order 2, and T is the functional determinant of f and H, namely ((,H) and thus 
of degree 9 and order 3. The value of i (also called S) can be obtained as 
follows: 

i="/yo0{(20a4 — 8a1a3 + 3a22)x2 + (100a5 — 12a,a4 + 2a2a3)xy 

+ (20a,a5 — 80204 + 3a32)y2] 
= Ax2 + Bry + Cy2 (5.3-13) 


60. E. Glenn, A Treatise on the Theory of Invariants, Ginn and Co., 
Boston, 1915, pp. 148-150. 
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Then J = D is the discriminant of the quadratic form 5.3~13, namely 
J = B2—4AC (5.3-14) 
The functional determinant T can be obtained from the expression 


10x+2a1y 2ayxtany anxtary 
T = aa 2ajxt+a2y a2xta3y = a3x+2aay 
agx+a3y azxt+2agy 2a4x+10asy 


= Dx} + Ex2y + Fxy2 + Gy3 (5.3-15) 


Then K and L are given by the following equations in which A, B, C, D, E, F, 
and G are defined as in equations 5.3-14 and 5.3-15: 

K = 2A@3EG — F2) — B(9DG — E F) + 2C(3FD — E2) (5.3-16a) 

L=1/3[43EG — F2)(3FD — E2) ~ (9DG ~ EF)? (5.3-16b) 
The rational equation 5.3-11 for @ can now be written as follows by combining 
J,K, and L (equations 5.3-14, 5.3-16a, and 5.3-16b) in all possible ways to 
get invariants of the required degrees 4, 8, 12, 16, 20, and 24 for the 
coefficients A4, Ag, A12, A16, A20, and A24, respectively: 


Dg) = G9 + DP + (baJ2 + b3K) G4 + (bgJ3 + bsIK + bol) P 
+ (b7J4 + bgJ2K + boK2 + big/L) ¢2 
+ (byJ9 + 123K + by3JK2 + byg/2L + b15KL)o 
+ by6J® + by7J4K +b1gJ2K2 + b19K3 + boQJ3L + bo JKL + 
bool = 0 | (5.3-17) 


The numerical coefficients bj, b2, ..., b22 can be found by taking a sufficient 
number of special quintics, whether irreducible or not, and comparing the actual 
coefficients of (5.3-17) with those involving the b’s. Berwick4 used the 
quintics x° —n = 0, x5 —x = 0, and quintics with the roots {1,-1,3,-3,0}, 
{1,1,-1,-1,0},  {1,2,-2,0,0}, {1,3,-3,0,0}, {1,-1,9,p2,0}, and 
{1,-1,2,-2,0} to evaluate these b’s where pis given by the equation p2 + p + 
1 =0 => p =(-1+ J-3). The numerical coefficients in the final results can be 
simplified by defining the following parameters: 
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j =S (5.3~18a) 
k = 2556K (5.3-18b) 
[ = ~210597, (5.3-18c) 


Then equation 5.3—17 becomes 

D(g) = $6 + 10jG5 + (35j2+10K) G4 + (60/3 + 30jk + 103 

+ (55j4 + 30j2k + 25k2 + 50jNg2 

+ (265 + 10j3k + 44jk2 + 59721 + 14kDo 

+ 5j6 + 20j2k2 + 20/31 + 20jkl + 257 =0 (5.3-19) 
Now assume that the quintic equation 5.3—1 is irreducible in the field K. When 
the sextic 5.3—19 has no rational root, the Galois group of the quintic is A5 or 
Ss according to whether the discriminant A = 55(J2 — 27K) is a rational square 
or not. The quintic 5.3—1 is soluble by radicals in K if and only if @(@) = 0 
(equation 5.3—19) has a rational root so that the Galois group of the quintic is 
Ms, Ds, or Cs (Table 2-3). 

In the case where equation 5.3—19 has a rational root 9, the following 
procedure discussed by Berwick‘ can be used to determine whether the Galois 
group of the quintic is Ms, Ds, or Cs. Thus consider the function 

X= (x1 — x2)2(x2 — x3)2 (3 — x4)? (x4 — X5)2(05 — x1)? (5.3-20) 
the factors of which correspond to the edges of the regular pentagon in Figure 
5-1. The group of this function is the dihedral group D5 whereas the group of 
the function 

Vor = (1 — x2) (02 ~ x3)(43 — x4) (04 45) — 1) (5.3-21) 
is the cyclic group C5. Now let @ be a rational root of the sextic 5.3-19 and 7 a 
root of the following quadratic, which is rational because 9 is rational: 

a — 64 + S12 ~ 27K) =0 (5.3-22). 
When this quadratic (or each of them if there are two) has no rational root, the 
Galois group of the corresponding quintic is the metacyclic group M5. 
However, when the quadratic 5.3—22 has a rational root v, the Galois group of 
the corresponding quintic is the dihedral group Ds unless one of the roots of the 
quadratic is a rational square in which case the Galois group of the quintic is the 
cyclic group C5. 


Chapter 6 


The Kiepert Algorithm for Roots of the 
General Quintic Equation 


6.1 Introduction 


The previous section (Section 5.3) discusses the problem of solving 
special quintic equations that are soluble by radicals, namely irreducible quintic 
equations with the soluble Galois groups C5, D5, and Ms. This chapter 
discusses the solution of general quintic equations including quintic equations 
having the Galois groups A5 and S5, which are not soluble by radicals. 

The least complicated candidate for a quintic equation not solvable by 
radicals is the one-parameter equation 

~O+ax+a=0 (6.1~1). 
The roots of (6.1—1) can be expressed! as the Weierstrass elliptic function 
Q(z; | 0,a), 1 <j < 5, (Section 4.2), in which 

[(0'@| ga,93)I? = 4 (z | g2,23)13 — 92.921 92,83)-83 (6-1-2) 
and z; (1 $j <5) are the five solutions of 

ga(z10,a) 0'(z|0,a) + iva go(z|0,a) + 2iVa =0 (6.1-3) 
in the period parallelogram of the elliptic function. 

These apparently elementary ideas are far from sufficient for a workable 
algorithm for solution of the general quintic equation 

x + Ax4 + Bx3 + Cx2+Dx+E=0 (6.1—4) 
because of the following difficulties: 

(1) Equation 6.1—1 is obtained by the trivial Tschirnhausen transformation 
(Section 3.3) ¢ = (8/a)x from the Bring-Jerrard normal form of the quintic 


equation, namely 


E+ a& + B=0 (6.1-5) 
5 
so that a nae #0 (6.1-6). 


1A. Hausner, The Bring-Jerrard Equation and Weierstrass Elliptic 
Functions, Amer. Math. Monthly, 69, 193-196 (1962). 
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However, transformation of the original general quintic equation (6.1-4) to the 
Bring-Jerrard normal form (6.1—5) requires an additional Tschirnhausen trans- 
ormation using two square roots and one root of a cubic equation (i.e., a total of 
three square roots and one cube root).* This Tschirnhausen transformation is 
very complicated.2 

(2) Equation 6.1—3 must be solved in order to obtain the five values of z; 
<j $5) in the period parallelogram in order to evaluate go (z; |0,a). 

The difficult Tschirnhausen transformation from the general quintic 
(6.14) to its Bring-Jerrard normal form (6.1-5)2 is avoided in the method 
described by Kiepert in 1878 for solution of the general quintic equation.3 The 
author in collaboration with Prof. E. R. Canfield has verified the Kiepert 
algorithm by programming it onto a microcomputer and testing the program by 
determining the roots of some simple irreducible general quintic equations.4> 

The general structure of the Kiepert algorithm is depicted in Figure 6-1. 
The Kiepert algorithm may be broken down into the following seven steps: 

(1) The Tschirnhausen transformation of the general quintic into the 
principal quintic | 

2 + Saz2 +5bz+c=0 (6.1-7). 
This Tschirnhausen transformation requires only a single square root in contrast 
to the Tschimhausen transformation of the general quintic (6.1-4) to its Bring- 
Jerrard normal form (6.1-5) which requires three square roots and one cube 
root. 


2A. Cayley, On Tschirnhausen’s Transformation, Phil. Trans. Roy. 
Soc. London, 151, 561-578 (1861); Cayley’s collected works, paper 
275. 

3L. Kiepert, Auflésung der Gleichungen fiinften Grades, J. far Math., 
87, 114-133 (1878). 

4R. B. King and E. R. Canfield, An Algorithm for Calculating the Roots 
of a General Quintic Equation from its Coefficients, J. Math. Phys., 32, 
823-825 (1991). 

5R. B. King and E. R. Canfield, Icosahedral Souiiieity and the Quintic 
Equation, Comput. and Math. with Appl.,24, 13-28 (1992). 
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General Quintic Equation 


x>4 Axt + Bx 4+ Cx? + Dx+ E=0 


cp Et tk = v3 + A? + Bu + C) i+ (zy -— v)2(2u + A) 


u4+Aus+Bu2%+Cu+D + (zz - v)(3u242Au+B) + (zR - v)* 
t 
a 


Principal Quintic Equation 
z>45az*+5bz+c=0 


ec 
K“(2/Z) - 3 


Brioschi Quintic Equation 
y>°-10Z/ +45Z*y—Z7=0 


hae ent ee ae RE 
Yk = ys (° — Sk)(Sk+2 — Sk+3)(Sk+4 ~ Sk+1) 


Jacobi Sextic Equation 


6 ——so — Sree oats 
S? + A (2 St ’ = 0 


Weierstrass elliptic functions 
Theta functions 
Period evaluation 


Figure 6-1: The general outline of the Kiepert algorithm for solution of the general quintic 
equation. 
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(2) A second Tschirnhausen transformation of the principal quintic (6.1—7) 
into the Brioschi quintic 

yp — 10Zy3 + 45Z2y — Z2 = 0 (6.1-8) 
in which the coefficients are expressed in terms of a single parameter Z. This 
Tschirnhausen transformation also requires only a single square root and is 
greatly facilitated by the use of polyhedral functions® and their special properties 
although Kiepert apparently did not recognize this point. Use of polyhedral 
functions leads to a modification of the Kiepert algorithm at one point in this 
step. 
(3) Transformation of the Brioschi quintic (6.1-8) into the Jacobi sextic 


10, 12g0 5 _ | 
s6 + 753 — Ars +49=0 (6.1-9) 
in which 
Z=-1/A (6.1~10). 


The roots, Soo and sr (0 <k < 4), of the Jacobi sextic (6.1-9) can be used to 
calculate the roots, yz, of the Brioschi quintic (6.1-8) by the relationship 


| 1 
Yk = V5 (Soo — Sk)(Sk+2 ~ $k+3)(Sk44 — Sk41) (6.1—-11) 


with addition of indices modulo 5. The Galois group of the Jacobi sextic 
(6.1—9) is of order 60 like the As Galois group of the quintic equations (6.1-4), 
(6.1-7), and (6.1-8) but involves transitive permutations of the six roots s.. and 
5% (O Sk <4) rather than only the five roots of the quintic equations.” 

(4) Solution of the Jacobi sextic (6.19) by the Weierstrass elliptic 
functions defined by the differential equation 6.1—2, followed by evaluation of 
the Weierstrass elliptic functions using genus 1 theta functions. 

(5) Evaluation of the periods of the theta functions corresponding to a 
particular Jacobi sextic (6.1-9). This requires solution of a cubic equation with 
coefficients simply calculable from the coefficients g2 and A in (6.1-9). A> 
simple function of the three roots of the cubic is then substituted into a special 


6F, Klein, Vorlesungen iiber das Ikosaeder, Teubner, Leipzig, 1884. 


70. Perron, Algebra, Third Edition, de Gruyter, Berlin, 1951, Volume 2, 
Chapter 5. 


The Kiepert Algorithm for the Quintic Equation 99 


infinite series called the Jacobi nome (Section 4.3).8 Klein? has described an 
alternative approach to period evaluation using hypergeometric series. !° 
Substitution of these periods into appropriate theta series then gives the roots Sco 
and sz (0 <k $4) of the Jacobi sextic (6.1~—9). 
(6) Calculation of the five roots, yg, of the Brioschi quintic (6.1-8) from 
(6.1-11) followed by undoing the Tschirnhausen transformations of the quintic 
equations in the following sequence: 

Brioschi (6.1-8) —> Principal (6.1~7) — General (6.1-4). 
Further details of these steps are given in the remainder of this chapter. 


6.2. The Tschirnhausen Transformation of the General 
Quintic to the Principal Quintic 


In order to transform the general quintic equation 


x5 + Ax4 + Bx3 + Cx2 + Dx+E=0 (6.2-1) 
into the principal quintic 
z) + Saz2 + 5bz+c=0 (6.2-2) 


the following relationships derived from Newton’s identities for the general 
quintic (6.1—4) are used: 


yx, = (6.2—3a) 
Yx,2 =A2 — 2B (6.2—3b) 
Yx43 = -A3 + 3AB -3C (6.2—3c) 
x4 = A4 - 4A2B + 4AC + 2B2 — 4D (6.2~3d) 
For the Tschirnhausen transformation of (6.2—1) to (6.2—2) use 
z=x2—uxt+y (6.2—4) 


8A. Fletcher, Guide to Tables of Elliptic Functions, Mathematical 
Tables and Other Aids to Computation, 3, 234 (1948). 

9F. Klein, Uber die Transformation der elliptischen funktionen und 
die Auflésung der Gleichungen fiinften Grades, Math. Ann.,14, 111- 
172 (1878), Gesammelte Mathematische Abhandlungen, Springer- 
Verlag, Berlin, 1923, Volume 3, pp. 13-75. 

10G, Sansone and J. Gerretsen, Lectures on the Theory of Functions 
of a Complex Variable, Woolters-Noordhoff, Groningen, 1969, Volume 
2, Chapter 16. 
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and solve for u and v in this expression. From the absence of a z4 term in the 
principal quintic (6.2-2) and equation 6.2—3a applied to (6.2—2) we get 


O = Dizy = Dixy? — udxy + Sv (6.2~—5). 
Substituting (6.2—-3a) and (6.2—3b) into (6.2—5) leads to 
5v =-Au—A2+2B (6.26). 


The absence of a z3 term in the principal quintic (6.2—2) and equation (6.2—3b) 
applied to (6.2—2) leads to 

0 = Yz2 = Sx? -— ux, + v)2 (6.2-7). 
Expanding the right hand side of equation 6.2~7 gives 

O = Sxg4 + u2Dixg2 + Sv2 — 2udx,3 + 2vdx2 —2uvdxp, —-(6.2-8). 
Substituting (6.2-3a—d) and (6.2-6) into (6.2-8) gives the following quadratic 
equation in u: 
(2A? — 5B)u* + (443 ~ 13AB + 15C)u + (2A4 - 842B + 10AC + 3B2 - 10D) =0 

(6.2-9) 

Solving (6.2-9) for u and substituting u and v into the following expressions 
gives the coefficients a, b, and c of the principal quintic (6.2—2): 


5a = -C(u3+Au2+Bu + C) + D(4u2 + 3Au + 2B) - E(Su + 2A) - 10v3 (6.2~—10a) 
5b = D(u* + 4u3 + Bu2 +Cu+ D) -E(5u? + 4Au2 + 3Bu + C)-5v4—10av (6.2—-10b) 
c = -E(u> + Aut + Bu3 + Cu2 + Du + E) - v — Sav? — Sbv (6.2-10c) 


In order to derive these equations define A’, B', C', D', and E' as coefficients 
of the u-modified quintic equation whose roots are the roots of (6.2—1) minus u: 

x9 + A'x4 + BY3 + C'x2 + D'x + E'= 

(x+uji+Aax+ult+Ba+u)3+Cx+u)2+Dat+u)+E=0 

| (6.2—11) . 

In (6.2—11) x is defined by the original general quintic equation (6.2—1) and u 
by the Tschimhausen transformation (6.2—4) and (6.2-9). ‘Then the equations 
for the coefficients a, b, and c of the principal quintic (6.2—2) can be defined in 
reverse order as follows: 
c (equation 6.2-10c): Write (6.2—2) as a product of linear factors using v 
as the variable to give 

v5 + Sav2 + 5bv +c =II(v —z,) (6.2-12). 
Substituting equation (6.2-4) into equation (6.2—12) gives 

w+ 5av2 + Sbv tc =I (xg? — uxy) = Tag (xg — u) = -EE' (6.2—13). 
However, 
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E'=E+Du+Cu2+Bu3 + Aus + 0 (6.2-14). 
Substituting (6.2—14) into (6.2-13) and solving for c gives (6.2—10c). 
b (equation 6.2~10b): Differentiate (6.2-12) to give 


544+10av+5b= 2 Tv - 2) (6.2-15) 
#j J 
= Ti xx(xe — 4) (from 6.2-4) (6.2—16) 
#j J 
i 
= Tre = od Xj — W) (6.2—17) 
, Poth 
= EE Ls — - 7 7 (6.2~18) 
J 
1 D 

But > =e 6.2—19 

xj -E ( ) 
J 
1 D'_ —_ 5u4 + 4Au3 +3 Bur +2Cu+D : 

and >) xj-u -E —(u? + Au4 + Bud + Cu? + Du + E) e220): 


Substituting (6.2-19) and (6.2-20) into (6.2~18) followed by routine algebra 
gives (6.2—10b). 
a (equation 6.2-10a): Differentiate (6.2-12) twice to give 


10v3 + Sa =— Ling —u) (6.2—21) 


fray. M2 


5 1 
k | fier ji jl ) j2¢ j2 ) 
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Vc 


1 
xe(xjp-u) ~~ uk xi-u 74 for k = j; and j2 to obtain a partial 


Multiplying 


fraction decomposition of the summand gives 


10v3 + 5a = 


Flaw» ()( Ets arcs) - Zeta 
AS J1#I2 
(6.2-23) 


The terms in (6.2—23) are evaluated in terms of the coefficients of (6.2-1) and 
(6.2~11) and using (6.2-19) and (6.2—20) as follows: 


1 _-€_C¢ 
> ee ee (6.2-24) 


Perce nC’ _C'__10u3+6Au?+3Bu +C (6.2-25) 
| Oxf — WGaj2 = w) - NEE —u) -E' E y3+Au4+Bu3+Cu2+Du +E ‘ 


y ——_!_-._ _| 2b" pre ares |) ? 
Fame xj1(xj2 —u) 7 cE io ee Ss 5a * = =~ 2) (6.2 26) 
Atha j 


Combining 
C. DD' 1 /D' D 
3 Se, [2 a e 
10v°? + 5a = ~DEE E+ E Es i (ce “z))| (6.2—27) 


=33(DD' +; ~(D'E -DE')-CE'-C'E) (6.2~28) 
Now, | 
1 OE ~ DE') = ~Du4 + (5E — AD)u3 + (4AE - BD)u2 + (3BE ~ CD)u + (2CE ~ D2) 


(6.2—29) 
Substituting (6.2-29) into (6.2-28) and using the values for C', D', and E' 
given in (6.2—14), (6.2-20), and (6.2~25) gives (6.2—10a). 
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6.3. The Tschirnhausen Transformation of the 
Principal Quintic to the Brioschi Quintic 


The objective of this step is the transformation of the principal quintic 
(6.2—2) to a Brioschi quintic of the form 

y>) — 10Zy3 + 45Z2y ~ Z2 =0 | (6.3—1) 
in which all of the coefficients can be expressed in terms of a single parameter 
Z. This Tschirnhausen transformation uses the following relationship between 
the variables in (6.2—2) and (6.3-1): 

2 = ate (6.3-2) 
This Tschirnhausen transformation is best described geometrically using 
polyhedral functions on a Riemann sphere (Section 2.5) and is closely related to 
the partition of an object of icosahedral symmetry into five equivalent objects of 
tetrahedral symmetry (Section 2.3 and Figures 2.5 and 2.6). Details have been 
presented by Dickson!! in 1930 and in a recent paper by the author? on the 
relationship between the quintic equation and, icosahedral symmetry. 

Consider a regular octahedron and a regular icosahedron represented by 
points on the surface of such a Riemann sphere so that the north pole is one of 
the vertices in each case. The polyhedral polynomials for the regular 
octahedron and icosahedron in these orientations in homogeneous form are as 
follows (Section 2.5) where x is taken to be u/v: 

A. Octahedron (O, symmetry): 


Vertices: T= uv(u4 — v4) (6.3—3a) 
Edges: ¥ = u!2 — 33u8y4 — 33u4v8 + yl2 (6.3—3b) 
Faces: W = u8 + 14u4v4 + v8 (6.3—-3c) 
B. Icosahedron (/, symmetry): 

Vertices: f = uv(ul9 + 11u5y5 — y10) (6.3—4a) 
Edges: T = 130 + 522u25¥5 — 10005u20y19 — 10005u!0y20 — 522u5v25 + 30 (6.3-4b) 
Faces: H = —u20 + 228u!5y5 — 494y10y10 _ 228y5yl5 — y20 (6.3—4c) 


11L, E. Dickson, Modern Algebraic Theories,. Sanborn, Chicago, 1930, 
Chapter 13. 
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The special symmetries of the octahedron and icosahedron lead to the followin g 
identities (Section 2.5): 

Octahedron: 10874 — W3 + y2 = 0 (degree 24) (6.3—5Sa) 

Icosahedron: 1728/5 — H3 — T2 = 0 (degree 60) (6.3—5b) 
Equation 6.3-5b plays a role in the Tschirnhausen transformation for 
conversion of a principal quintic (6.2-2) to the corresponding Brioschi quintic 
(6.3—1). 

Consider further the icosahedron on a Riemann sphere so that one 
vertex is located at the north pole. The 30 edges of the icosahedron can be 
divided into five sets of six edges each (Figure 2.5) so that the midpoints of the 
edges define five octahedra whose vertex functions t, and face functions W, are 
as follows where € = exp(2mi/5) andO<k <4: 
ty = erky® 4 2e2kyDy — Seky4y2 — Se4ky2y4_ 2yedkyy5 4 e2ky6 (6.3—6a) 
Wy = 04k y8 4 edkyTy — Je2kySy2 — JekySy3 4 Tetkyry5 — Jerky2y6 — g2kyy7 — eky8 

(6.3—6b) 
The vertex functions fy are the five roots of a Brioschi quintic equation 

DO — 10/3 + 45f2r-T =0 (6.3~-7) 
in which f and T are the vertex and edge functions of the icosahedron defined 
by (6.3-4a) and 6.3-4b), respectively. Equation 6.3~1 is a special case of 
equation 6.3—7 in which f2 = T so that t = yT/f2. 

Now consider the principal quintic (6.2-2). Express its roots as the 
polyhedral functions 


2h = (#) Wit ta We (6.3-8). 
In (6.3-8) f, H, and T are the icosahedral functions (6.34) and t% and W;, are 
functions (equations 6.3-6a and 6.3—6b) of the five octahedra formed by the 
midpoints of the 30 icosahedral edges (Figure 2.5). The right hand side of 
(6.3-8) is thus homogeneous of degree 0 in the variable pair u, v. The ability 
to express the roots of any principal quintic (6.2-2) by (6.3-8) depends upon 
the following properties of the polyhedral functions (6.3-6) which arise from 
the impossibility of any product of the icosahedral functions f, H, and T, of 
degrees 12, 20, and 30, respectively, having degrees 8, 14, 16, 22, or 28 and 


which lead to the vanishing of the z4 and z3 coefficients in the principal quintic 
(6.2—2): 
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LWe = LW, = LWA = DW? = TH2W;2 =0 (6.3—9). 

The next step is the calculation of the coefficients a, b, and c of the 
principal quintic (6.2—2) with roots zz expressed by (6.3-8) in terms of the two 
composite icosahedral functions Z and V where 


ap. 
Z= 72 (6.3—10) 
H3 
= 6.3-11 
p ( ) 
so that Z and V are related by 
5+V=1728 (63-12) 


derived from the identity 6.3~5b. This calculation uses the powers fy" and Wy 
(2 $n <4) calculated by brute force from (6.3-6a) and (6.3-6b) although only 


the first few terms of each power are required as well as the relationship 
5 
Sierk = 0 (6.3—-13). 
k=1 


By comparing the indicated terms with the indicated products of the icosahedral 
functions f, H, and T (equation 6.3-4) the equations 6.3—14 and 6.3—15 can be 
derived as follows: 


u22y2 terms: SW,? = (5)(-24)f? = -120/2 (6.3—14a) 
u?9 terms: SWy314 = (5\(-IT = -5T (6.3-14b) 
u33y3 terms: LW ,31,2 = (5)(20-18-96-22)u33¥3 = (5)(-72)7 = -360P (6.3—14c) 
u4ly terms: LW P73 = (5)(-6+3)u4}v = (5)\(-3fT = -15fT (6.3-14d) 
udly terms: SW;4 = (5)(-4)u3 lv = (5)(4)fH = 20 fH (6.3-15a) 


LW ;4t, = 0 since no product of f (degree 12), H (degree 20), and T 
(degree 30) can have degree 38. 


u42y2 terms: SWAty2 = (5)(34-16-6)u42v2 = (5)(-12) 2-H = -60/2H (6.3—15b) 
uO terms: SWy4ty3 = (5\(-DAT = -SHT | (6.3-15c) 
u93y3 terms: LWi414 = (5)(-88-16+272-60)03 v3 = (5)(-108) RH = -540RH 

. (6.3—15d) 
In addition 

d2z3 = —15a (6.3—16a) 

d2z4 = -20b (6.3-16b) 


Calculating Yzp3 and>z,4 by expanding (6.3-8) and substituting the values 
from (6.3-10), (6.3—14), (6.3-15), and (6.316) gives 


AMON Mee I SE Moin ETE RS oe Meat favat Fide, tm oe eae he ae . 
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Va = 8A3 + A2u + (72Ap24+ 13)Z (6.3~17) 

Vb = -A4 + 18A2u2Z + Ap3BZ + 27p4Z2 (6.3-18) 
Now consider (6.3—7) and its five roots tg to give 

x — 10fx3 + 45/2x —T = Ta - tg) (6.3-19) 


for all values of x. Now let x be -AT/pf2 which generates the Brioschi quintic 
(6.3~1) if y = —A/p. 
Then 

THAT + pf2t,) = APTS — 1003 n273P + 45Ap47f!9 4 fOr (6.3—20). 
However, []W, = —H2 (6.3-21) 
so that substituting the values for V and Z from equations 6.3—10 and 6.3-11 
gives 

Ve = AD — 1003 2Z + 450 p4Z2 + p5Z2 (6.3~—22) 
Thus the values of a, b, and c obtained from (6.3—17), (6.3—18), and (6.3—22) 
determine the coefficients of the principal quintic (6.2~2) corresponding to the 
Brioschi quintic (6.3—7) with the changes of variables (6.3—2). 

The equations 6.3-17, 6.3-18, and 6.3-22 obtained above can now be 
inverted so that the parameters A, p, Z, and V can be calculated for the Brioschi 
quintic (6.3~1) corresponding to a given principal quintic (6.2—2) with 
coefficients a, b, and c. First add AVb from (6.3-18) to Ve from equation 
(6.3-22) to give u2ZVa as determined by (6.3—17), ie., 

p2Za = Ab+c (6.3—23). 
Then subtract p2ZVb (6.3~18) from AVc (6.3-22) and use (6.3-23) to give 

ee (A2 — 3u12Z)3 5 (ad2 — 3bA ~ 3c)3 

Ac~p?Zb = a®(Aac — Ab2 - bc) 


Now combine equations (6.317) and (6.3—18) in the indicated manner to give 


V Coa = A3 + 216A2uZ + 9Ap2Z + 216u3Z2 (6.3—25) 


(6.3-24) 


Divide the square of the right hand side of (6.3—25) by Z and subtract the result 
from 27 times the square of the right hand side of (6.3-17) and combine pairs 
of terms related by the identity (6.3—12) to give 


V 2 
oD 8b)" (22-3223 (6.3-26). 


Now substitute (6.3-24) into (6.3-26) to give 


27a2V — 
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(Aa + 8b)2 
Pe oar ae 
Substitution of (Ab + c)/a for u2Z from (6.3—23) now gives the following 
quadratic equation for A in terms only of the coefficients a, b, and c of the 


27a? = Ac — p2Zb (6.3—27). 


principal quintic (6.2—2): 
A2(a4 + abc ~ b3) ~ A(L1a3b — ac? + 2b7¢) + 64a2b2 - 27a>¢ - bc? =0 (6.3—28) 
After solving this quadratic for A, the value for V can be found by (6.3—24). 
After obtaining 4 and V, equation 6.3-17 can be rewritten as 
(A2 + u2Z) yu = Va ~— 843 ~ 72Ap2Z (6.3-29) 

Substitution of (6.3—23) for 2Z and solving for ps then gives 

_ Vat — 8A3.a — 72A2b - 72Ac 

7 Ata+Ab+e 
In this way the four parameters A, y, V, and Z are obtained for a Brioschi 


(6.3—30) 


quintic (6.3—1) with roots (6.3-8) corresponding to a principal quintic (6.2—2) 
with coefficients a, b, and c. 

There remains the problem of deriving the Tschirnhausen transformation 
(6.3-2) relating the principal quintic (6.2—2) to the one-parameter Brioschi 
quintic (6.3—1). The functions W, corresponding to the octahedron vertex 
functions t% which are solutions of the more general Brioschi quintic (6.3~7) 
vanish at the midpoints of the faces of the octahedra which are located at the 
midpoints of the faces of the underlying icosahedron where H also vanishes. 
Hence each W, is a factor of H. In addition, transposing the term T of the 
quintic (6.3~7), squaring both sides, and replacing ¢2 by 3f gives 

1728f5 = T2 (6.3-31) 
Substituting the icosahedral identity (6.3—Sb) into (6.3-31) gives H =0, 
indicating that t,2 — 3f is a factor of H for each k and so 

H = Wre(t,2 — 3f) O<sk<4 (6.3—32). 
Substituting (6.3--32) into (6.3—-8), suppressing subscripts, using (6.3—10) for 
Z, and the relationship ¢ = yT/f2 to convert the two-parameter Brioschi quintic 
(6.3-7) to the one-parameter Brioschi quintic (6.3—1) gives the required 
relationship (6.3—2) between the variable z of the principal quintic and the 
variable y of the Brioschi quintic (6.3~1). 
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6.4. Transformation of the Brioschi Quintic to the 
Jacobi Sextic 


The objective of this step is the transformation of the Brioschi quintic 
(6.3-1) into a Jacobi sextic 

56 — 10fs3 + Hs + 5f2 =0 (6.4-1) 
whose roots can be expressed in terms of elliptic functions. The Galois group 
of (6.4—1) like that of the Brioschi quintic (6.3-1) or (6.3—7) is a transitive 
group of order 60 but now involves permutations on six objects (the roots of 
equation 6.4—1) rather than five objects (the roots of equation 6.3—1 or 6.3--12). 
If one of the roots of the Jacobi sextic is designated as Soo, the five other roots sx 
(0 <k <4) have the special form 


es (S)( yeh = = (6.4-2) 
where € is any primitive fifth root of unity. From here on we shall take € to be 
exp(21i/5). 


In order to effect this transformation the following theorem of Perron? is 
used: 
Theorem 6.4—1 (Perron): In order to find the roots of the Brioschi quintic 
DB — 10f8 + 451 -T =0 (6.4—3) 
the quantity H for the corresponding Jacobi sextic equation (6.4—1) is 
determined to satisfy the icosahedral identity 
17286 ~ H3 ~T2 =0 (6.4—4). 
If the roots of this Jacobi sextic are soo. and sz (0 < k < 4), then the roots t, of 
the Brioschi quintic (6.4—3) are 


1 
tk = | yas — Sk)(Sk+2 — Sk43)(Sk44 — Sk+1) (6.4—5). 


The ¥5 in (6.4—5) arises from the relationship 
et4e—e2—e3 = 5 (6.4-6) 
and the sign of the outer square roots can be obtained by writing the Brioschi 
quintic as 
T 


“A 10fe2 + 452 oe 
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so that only even powers of ¢ appear on the right hand side. Choice of a 
different primitive fifth root of unity for € may lead to replacement of 15 on the 
right side of (6.46) by -Y5. 


Proof: Consider a special quintic equation 
4 


uy — 1) = 1 + ayn + a2n3 +4372 +a4n+a5=0  (6.4-8) 


in which the roots have the form 

cyek + cer + c3e3k + cgetk= mye = (OSkS4) (6.4—9). 
From Newton’s identities (e.g., equations 6.2—3) the coefficients a,, (1 <m < 
5) are found to be 


a, =0 (6.4—10a) 
a2 = -5(cjc¢4 + €2€3) (6.4—10b) 
a3 = ~5(c12c3 + c22c} + ¢32¢4 + 6429) (6.4~—10c) 


ag = ~5(cpF eq + c23¢q + C33 C1 + 64303) + SCC] 2042 + 022032) - Seyc2¢3c4 : (6.4—10d) 
as = —(c4> + 62° + €3° + €4°) + 5(e13¢3¢4 + €23¢103 + 6330204 + €43C109) - 


5(c17¢22c4 + €17¢3202 + 0224203 + €37¢42C}) (6.4-10e) 
Application of equations (6.410) in the special case 
Cc, =F; c2 = 0; c3 = 0; cg =—-1/r (6.4-11) 
leads to 
4 


k ~ vet ~ =a =m+53+57 =P ~ 3) (6.4—12) 
I] re 


or if 7) — 1 is used instead of 7) 


4 
in ‘ + rek— a) = mp —5n4 + 157? - 2572 + 25-1250 (6.413) 
I] rek 
where 1250= 114+), - (6.4-14). 


Replacing 7 by —7, multiplying the resulting equation by (6.4—13), and 
changing the sign gives 
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[lr (+See 8) 
[Yr a 


= 710 + 578 + 256 + 125(1 - 100)74 + 625(1 - 100)? - 12520 (6.4-15) 
Now replace 772 by 5s/p and multiply first by (p/5)° and then by s — p to give 


(s — p) : E x 3(! + rek— xy 


= 56 — 100p3s3 - (1 — 100 + S02) p5s + 5 0% (6.4-16) 
Equation 6.4~16 set to zero corresponds to the Jacobi sextic (6.4—1) in which 

f=op3 (6.4-17a) 

H =~(1 — 100+ 502)p° (6.4-17b) 
and the roots are 

Sea = p and Sin =5( + rem — =) O<m<4) (64-18) 


Now consider the following five quantities which will be used to derive the five 
roots of a quintic inv (0 <k <4): 

Vk = (Soo — Sk)(Sk+2 — Sk+3)(Sk44 — Sk+1) (6.4-19). 
Substituting (6.4—18) into (6.4-19) and using (6.4—6) and (6.4-17a) where 
appropriate gives 


Vk 

—=-4f= 
2 ° 

6 Set ( 2429 er (- 345 edt ( A241 t] 
cakG 24 q ye* + I5r°+"3 e-*4+¢ -207 v3 er-® + § 6 sm € 


(6.4—20) 
Now consider the quintic polynomial 
4 


| ff ~ (7 - 4) =v + a4 + aqv3 + ayv2 + agv + a5 (6.4-21) 


The coefficients a, of the polynomial have the form 


The Kiepert Algorithm for the Quintic Equation 111 


in = ) > Cip3!*5i0!(1 — 100 + 502)/ (6.422) 
j 


l 


where 3m = 3i + Sj. 
Considering (6.4—20) as a special case of (6.4—9) and applying the formulas 
6.4~10 gives 


| E = (15 7 r)| =v + 30/23 + 1008 v2 + 1054v +368 +H = (6.423) 
k=0 


Now replace v by w where v =—~ (6.4-24) 


aid 
ir if 
and multiply by 2575 = (V5)5 to give 


4 = 
[lov - vg) = w5 — 20V Sfw4 + 950 2w3 — 4500V5hw2 + 50625w + 25V5(H3 - 
k=0 

1728/9) = 0 (6.4—25) 
Now replace w by ¢ where w = ¥5/2 (6.4-26) 
and divide by 2575 to give 


£10 _ 20718 + 190/215 — 900 f84 + 2025f42 + (H3 — 1728/5) = 0 (6.4—27) 
which can be written as 

(P — 10ft3 + 457202 = -H3 + 1728 (6.4-28) 
Substitution of the icosahedral identity (6.44) in (6.4-28) and then taking the 
square root of each side gives the Brioschi quintic (6.4~3). Thus 


t= | (6.4—29) 


after considering the effects of the variable changes v->w—>f with the outer 
square root arising from equation (6.4~26). 

This concludes the summary of Perron’s proof. 

The next step is to relate the coefficients f, H, and T of the Brioschi 
quintic and the associated Jacobi sextic to the invariants g2, g3, and A of 
corresponding elliptic functions satisfying the following identity (compare 
equation 4.2-19g): 
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A= 873 — 27232 (6.430) 
Write the Brioschi quintic (6.3—7) as 


10, 45. 216 
5 pars poe pees 
P+7t +5! me =0 (6.4-31) 
—1 
so that f= x (6.432) 
and T = 108s (6.4-33). 
Applying the icosahedral identity (6.4—4) and the elliptic identity (6.4—30) leads 
to the relationship 
23 
A== (ar) ~ 27932 (6.4—34) 


and the Jacobi sextic corresponding to the Brioschi quintic (6.4-31) is 


10g 2 5 
6453 1282 54 70 (6.435). 


Now let us relate the parameter Z in the one-parameter Brioschi quintic (6.3-1) 
to the elliptic invariants g2 and A. Comparing (6.3—1) and (6.4—31) indicates 
that 


= 5 (6.4—36) 
1 21623 A 

Z2 = Ae — AB => £7 = 216 (6.4—37). 
Substituting (6.4-37) in the elliptic identity (6.4—30) leads to 

yO RN et 

A178} yoez oo 
Defining V by the identity (6.3-11) then gives 
3 
ye oo 2 (6.4-39). 


Thus the elliptic invariants A and g2 can be obtained from the V and Z calculated 
in the previous section by the following simple relationships: 
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Anz (6.4-40) 
3 


2 
VVA_ 1 1 — 17282 
§2= D>. ~ 72 X Z2 (6.4-41) 


6.5 Solution of the Jacobi Sextic with Weierstrass 
Elliptic Functions 


The solution of cubic equations (Section 5.1) using the (periodic) 
trigonometric functions makes use of their period division by three. For 
example, the solution of the cubic equation 

x3 +3Hx+G=0 (6.5-1) 
can be expressed as 


xp = 2\-H cos( 2+) (0<k <2) (6.5-2) 


where cos¢ = (6.5~3). 


—G 
2V-H3 
Analogously the solution of the Jacobi sextic (6.4—1) corresponding to a 
Brioschi quintic (6.3~7) relates to the period division of the doubly periodic 


elliptic functions by five. Thus using a formula derived by Kiepert!@:13 the 
following equation will be solved (compare equation 4.2~28): 


Q' gp” g'™ glv 

o6u) 1 | @" @" pV @V 

a5) 2'3!42| g'" pV pV eV 0 (6.5-4) 
galv gov gl vil 


12K. Weierstrass and H. A. Schwarz, Formeln und Lehrsdtze zum 
Gebrauche der Elliptischen Funktionen, Springer Verlag, Berlin, 
1893, p 19. 

13, Kiepert, Wirkliche Ausfiihrung der Ganzzahligen Multiplikation 
der Elliptischen Funktionen, J. fiir Math., 76, 21-33 (1873). 


es Ai a RAL ROUEN LO CL eee 


EY te ORIEN AI TRE TOTO TTL al oe TY 


i 
i 
; 
i 
; 
H 


114 Beyond the Quartic Equation 


In (6.5-4) the entries in the 4 x 4 determinant are the first through seventh 
derivatives of the Weierstrass go function (Section 4.2) defined by the 
differential equation 

(9'" =493 - 29 - g3 (6.5~-5) 
and the Weierstrass o function is defined by 


—d2{In o(u)] 


he (compare equation 4.2—11c) (6.5—6). 


§9(u) = 


Since the zeroes of o(u) occur at u + 2ma + 2na', the determinant in (6.5—4) 


vanishes when 


2m@ + 2na' 
Umn Sa (6.5~7). 


In equation (6.5—7) 20) and 2" are the two periods of the elliptic functions and 
m and n are integers mod 5 both of which are not zero. The period division by 
five in (6.5~7) is analogous to the period division by three of the cosine in 
(6.5—2) used to solve the cubic equation (6.5—1). 

Evaluation of the determinant in (6.5—4) is found to provide a method 
for expressing the roots of the Jacobi sextic (6.41) in terms of the Weierstrass 
elliptic go function (6.5—5). The periods of the elliptic functions can be 
calculated from the coefficients of the Jacobi sextic (6.4~—1) or the corres- 
ponding Brioschi quintic (6.31) by either of two methods, one of which will 
be presented later in more detail. 

In order to evaluate the determinant in (6.5—4), Perron’ uses the 
following derivatives of 49(u) in addition to a'(u) (equation 6.5~5): 


"= 692-2 (6.5-8b) 
gp" = 129 ¢' (6.5~8c) 
@lY = 12(¢~')2 +12¢9 ~" (6.5—8d) 
QY =369'e"+ 120 0" (6.5--8e) 
ol = 36(¢")2 + 48 0' fo" +129 @lV (6.5—8f) 


eV = 1209" 9"+ 60¢9'plV +129 pV (6.5-8g) 
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This sequence of derivatives of go resembles Painlevé chains discussed 
elsewhere by the author.'4.15 

Several steps will now be taken in order to simplify the determinant in 
(6.54) before its explicit evaluation. First subtract 12 fa times the first column 
from the third column and subtract 12 9 times the second column and 12¢' 
times the first column from the last (fourth) column after substituting the 
formulas for the derivatives (6.5—8c to 6.5—8g) to give 


g’ g" 0 3 0 
p" ep" Iw"? pio — | i 
ge" ply 36 9' (" 36( (0")7+36 90" "" =0 (6.5 9). 


pV oY 36("2+489'—" 1209'9"+480' pV 


Now perform the same operation with the rows and suppress factors of 12 in 
the third and fourth columns to give 


{9 " 0 0 
gQ" go" (g0')? ) '¢" 2 é 
0 12( 4a’)? 3 ' fo" 3( 9") +3 ga' fo" = 

0 Upp" 39"2+4e'~p"-Re(py l0p"—"+49'—'V-2499'~" 


(6.5—10) 


Now substitute the expressions (6.5—8c) and (6.5-8d) for go" and gly, 
respectively, and suppress factors of three in the third and fourth rows to give 


Qo a" 0. 0 
9" Rpg (g'* 290 2" 7 2 
0 4( o')? pp" (g0")?-+12 9( @'? = 0 (6.5 1 1). 


0 82'p" (p"*+2p(VY 8eQe'p"+16(o'P 


Multiplying out this determinant gives the following polynomial which is of 
degree 12 in go after considering (6.5—5) and (6.5-8b): 


14R. B. King, Systematics of Strongly Self-Dominant Higher Order 
Differential Equations Based on the Painlevé Analysis of their 
Singularities, J. Math. Phys.,27, 966-971 (1986). 
ISR. B. King, Painlevé Chains for the Study of Integrable Higher- 
Order Differential Equations, J. Phys. A.: Math. Gen., 20, 2333-2345 
(1987). 
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[(@")? - 12 0(e' 8 - 16(9')4 e"[(e")? - 12 e( 7] - 64(e')8 = 0 (6.5-12). 


The 24 values of Um» in (6.5—7) relate to the 12 roots 9m,» of (6.5—12) in pairs 
by the relationship 

9 —-m,-n = P mn (6.5—-13) 
using mod 5 arithmetic on the subscripts. 

Next Perron evaluates — 


4 4 2 
Ymn = P2m,2n- Pmn= P —) - 9 ee eee (6.5—14) 


and uses the following formula from Weierstrass: 


Bij e(2u) = esos PQ) 2"w) ~ gu)” (6.5-15) 
dye ee 460"(u)* : ; 


Substituting equations (6.5~8c) and (6.5—14) into (6.5—15) gives 


( “" )2 - 12 1 2 
Yonn = §2 mn Lake m,n) (6.516) 
4( mn) 
for all integers m and n mod 5 excluding m =n=0. Substituting (6.5—16) into 
(6.5~12) and dividing by 64(,a')® gives the cubic equation 
y3— p"y-(g')?=0 (6.5-17). 


Also solving (6.5-16) for (")2 gives 


(@") = (y+ 12) e'? (6.518). 
Combining (6.5-17) and (6.5—18) gives 
(90")? = (4y + 12¢)03- py) (6.5-19). 


Substituting the expressions for (4a')? (equation 6.5—-5) and go" (equation 
6.5—-8b) into (6.5~—17) and (6.5—19) gives the pair of equations: 

y3 — (6 0? -7 go)y - (403 — g29 — 83) =0 (6.5-20a) 

(6 9? —7 g2)? = (4y + 120)3 — 6 9y +7 gay) (6.5-20b) 
Perron’ now proceeds further by introducing a new variable z where 

fa =4(z—-y) (6.5—-21). 
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Substituting (6.5—21) into (6.5—20a) and (6.5—20b) and ordering according to 
powers of z gives 
z3 — (3y2 + g9)z — 223 =0 (6.5—22a) 
Qz4 — 6(Sy2 + 99)z2 + Sy — 2goy? + 297 =0 (6.5—22b). 
Solving (6.5-22b) for z2 by using the quadratic formula gives 


322 = Sy2 + 99 + 2yV5y2 + 322 (6.5—23). 
Writing (6.5—22a) as 

2(22 — 3y? — go) = 283 (6.5-24) 
and squaring gives 

22(22 — 3y2 — go)? = 4g3? (6.5-25). 


Substituting (6.5-23) for z2 into (6.5~25) gives 


(Sy2 + go + 2yV 5y2+ 392 )(—4y? — 2g2 +2yV Sy? + 3g2)2 = 108 gs? 
(6.5—26) 


after multiplying both sides by 27. Now multiplying out the expressions on the 
left and dividing both sides by 4 gives 


Sy — 2ySV Sy2 + 3e2 +2923 = 27823 (6.5~27). 


Note that the powers y” (1 <n < 4) outside the radical conveniently cancel 
during this process. Substituting the elliptic identity (6.4-30) into (6.5-27) and 
rearranging gives 


5y6 + A= 2yV5y2 + 322 (6.5-28) 


which after squaring and combining terms gives 

5yl2 — 12g9y10 + 10Ay® + A2 =0 (6.5—29). 
The 12 roots of equation (6.5—29) are given by equation (6.5~14). 

Perron’ obtains the Jacobi sextic (6.4-35) from (6.5—29) by introducing 
the change of variable 


Lb ITA OENE RIO NIM SEAS RL ILLS TELE IES ALN, POOLE RO IER, OER E EBRD CIA EAR TRAM hate EAA haa PD recat ENTS AAR LY SPAM OCA ALOE IIIS FAS IBESS, 2 =8 SORE SES 
SN Mani es SCD o EE RMT aed ae NI Thee tee NER Te . . tee : we esate nate arr a ee ewes . z 


Sty kaye arose 


118 Beyond the Quartic Equation 


yes (6.530) 


oe 


6 
and multiplying both sides by oo . Because of the nature of the transformation 


(6.5-30) and the roots (6.5—14) of equation 6.5-29, Perron’ concludes that the 
roots of the Jacobi sextic (6.4~35) must be the Weierstrass elliptic functions 


1 
oy RS ei ee ae etree eee Ie eens oe 6.5-31). 
ie (ene + LM : ene ‘ 2ne) ( ) 


5 5 


6.6. Evaluation of the Weierstrass Elliptic Functions 
Using Genus 1 Theta Functions 


The analysis up to this point provides a method for expressing the roots 
of a quintic equation by the expression (6.5-31) involving Weierstrass elliptic 
functions of the periods 2@ and 2m'. The method for evaluating the 
Weierstrass elliptic functions (6.5~31) by means of theta functions of genus 1 
as given by Kiepert? will now be discussed. Such theta functions (Section 4.3) 
use the argument q defined by 


qs enn“ | (6.6-1). 
@ 


Equation (6.6—1) indicates that g depends upon the period ratio w'/@ of the 
elliptic functions. The rapidly converging series used to evaluate the relevant 
theta functions converge when lql < 1. 

In order to avoid a factor of the 24th root of unity, Kiepert? writes the 
six roots of the Jacobi sextic (6.4—1 or 6.4-35) as 


— @ (42) ent 
5 )7 OVS 
Ug a ee ey 4). 


2@' + 48k@ 4@' + 96k@ 
of 5 )- o( 5 
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Because of the double periodicity of the elliptic functions, equations 6.6-2 and 


6.6~3 are equivalent to equation 6.5-31. 


Kiepert3 converts equation 6.6—2 to the corresponding theta function by 
using sigma functions (6.5—6) for which the following relationships apply'®: 


EE px (aay | ey ame eT Pe 
«neo (SVS ar ae) 6 


in which 
Ga) wey _ F(@) a miu 
S@) = Ca) = ay He oP Oa) 
Also 
Vl vtu 
OW OO) = rsa? 


by (4.2—26a). In addition 


o(u + 2m@ + 2n) = 
(—1)mntortng(ujexp[(2m + 2nG)(u + tion) 
by equation (4.2~27b) and 


Substituting (6.6-6) into (6.6—2) gives 


45) (“2 
2) (2) 


(6.6—5). 


(6.6-6) 


(6.6—7) 


(6.6-8). 


(6.6—9) 


16K, Chandrasekharan, Elliptic Functions, Springer Verlag, Berlin- 


Heidelberg, 1955. 
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From (6.6—7) 
6a 4a 2 
o(F)= o(- or 20)=0 (Pe( 22) (6.6-10) 
so that 
20@\ (4@ —~20@ 
Soo = o(F)o(P)exn( ) (6.6—-11) 
Recall ¢ = ex0( 5") (6.6-12) 
so that 
4@ 
r(Bo)- Ve andr (= Sa w)= € (6.613) 
and (6.64) for the cases of interest becomes 
20\ 2a Co) . /m 1-qtie 1- grie4 
“ \=3 ene (FE 5 )sin(5) ‘ i ae se ae i (6.6—14a) 
40) 20 [ 1- gtie2  1- gig 
ces )=4 eX xp (“$5 5 )H(F) ic aa) . peers (6.6~—14b) 
so that 
1 qq! 
sin GL ons (6.6-15) 


For further simplification Kiepert? converts the infinite product (6.6~15) into an 


infinite sum using the relationships 


an (2) Yale ee f Dewees? 


and 


(6.6—16) 


The Kiepert Algorithm for the Quintic Equation 


4 sin() sin(*) = 5 


Then 
3 oO 
Vino = 22 Geen 7) 
om a Qi 
{a T= aa 


Substituting (6.6-16) into (6.6—18) Kiepert obtains 


oo 


SY (-Diqh(Gi+1)7/12 
Te _N5 isco 
6 oo 
VAY Cryig(it 212 
[=—~co 


Define 


B= oA YS (Digit D712 
i= 


=O 


Then 


{Seo a Si(-Digh6F 7/12 


j=—00 


In an analogous way Vs; from (6.6—3) becomes 


20@'+48ka¥ (4a'+ 96kaY 
4 5 o 5 
Vs, = 


20' + 48ko@ 6a@' + 144ka@ 
i é Al 5 


Using equation (6.6—7) | 
6a' + 144ka@ 4a' + 96k@ 
(iyi) of pst 


oe exp es + 48k0( 5 


and 


+2w+ 48ko)= 


@' + 24k@ 
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(6.6—17) 


(6.6~18) 


(6.6-~19) 


(6.6-20). 


(6.6—21). 


(6.6-22). 


(6.6—23) 


eK 8 


ie Women EN RRS Ct cetInE LIS Te rer CU ID pete Cane enable 


Oat thi my mt ere mA Oe AI URI re ermangentag a 


ROL A, IRI AT AR OW REL A IROL FRAT RL CENA MON ARNE IEA ate LAO OM A ENS AUN Ne tee Sect eee Ma Ine tS Oe 
meme, erento nese <n emermeeery eer er esenaarannintyintirtartwr vn yneoa smeaminaee Rees Se oy Ea es aa 
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sx = exp -e¢ + asky(2 +2) (2 + Asko (40 + 96k) 


(6.6—24). 
Then 
pelha 
5 
20'+ 48kw\ (20 ye > Vgel2k 
20-5812) 2) up| 20+ 2440))| — ge 
| 1 — q2i#2/5~g24k 1 — g2j-2/5¢-24k 
“A ca a oe) ae (6.6—25a) 
and 
5 1 
Vg? e24k — 
5 
4@' + 96k@\ (2@ ae 9 gq? E2Ak 
, merc J= Jexp Px @' + 24k@) + 


“4 if 1 - qi 1 - q2i (6.6~25b). 


By substituting (6.6—25a) and (6.6~25b) into (6.6—24) Kiepert3 concludes 


aa | 1 — €24kig2il5 


15 
Vqva Jl 


S(T fekGi+ 1)? g(Gi+ 1)7/60 


—— 


6 oO 
VAS ryigoit 7/12 


[=---co 
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oo 


=¢ ¥ (CA yiet Gi+ 1)? g(6i+1)/60 (6.6-26) 


j= e5 


in which B is defined as in equation 6,6-20. The roots of the Jacobi sextic 
(6.41) can thus be expressed as quotients of two odd theta functions of genus 
1 through (6.6-19), (6.6—20), and (6.6~26). There remains the problem of 
determining the periods 2@, 2@', and therefore g through (6.6—1) corres- 
ponding to a given Jacobi sextic (6.4—1) defined by the parameters f and H 
corresponding to the vertex and face functions of the icosahedron used to 
generate the corresponding Brioschi quintic (6.3-7). 


6.7. Evaluation of the Periods of the Elliptic Functions 
Corresponding to the Jacobi Sextic 


Two methods are available for determining the periods corresponding to 
a Jacobi sextic (6.4—1): 
(1) Use of hypergeometric series?-!0 based on the following differential 


equation: 
dz 72 7 dz. 2 
rane (3- a) T-14a = 9 (6.7-1) 
where 
3 
J eee Ae 


The details of this method are beyond the scope of this book since they require 
consideration of the properties of hypergeometric series. 
(2) Use of the Jacobi nome (Section 4.3). This method is the one that was 
used in the microcomputer program‘. and is discussed in greater detail in this 
section. 

Determination of periods with the Jacobi nome uses the relationship 
between Jacobi and Weierstrass elliptic integrals (Section 4.1). Thus consider 
the Jacobi elliptic function!” 


ITA. W. Erdelyi, W. Magnus, F. Oberhettinger, and F. G. Tricomi, 
Higher Transcendental Functions, McGraw-Hill, New York, 1953, 
Chapter 13. 


POT ee Pe ee DN SO a a 
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w=snk (6.7-3) 
with modulus k defined by inversion of the integral 


Ww 


a. (6.7-4) 
V1 — x2)(1 — k2x2) 


which has the square root of a special quartic polynomial in the denominator. 
Now write the differential equation 6.5—5 for the Weierstrass elliptic function in 
an analogous manner so that 

w= gu) (6.7—5) 
and 


Ww w 
Sieh a ee (6.7-6). 
V4x3 — gox — 83 V(x = e1)(x — €2)(x - €3) 


The modulus & and complementary modulus k' can then be expressed"? in 
terms of the roots e], e2, and e3 of the cubic polynomial of equation 6.7—-6 by 
the relationships 


K2 7. and (k')2 = 1—k2 er (6.7-7). 


Now calculate 


bone. Vel — €3 Seer 
L =o 7 aay Oe (6.7-8). 


Then a value of g (equation 6.6~1) corresponding to a set of roots e}, e2, and e3 
of the cubic equation by using the Jacobi nome (Section 4.3) 
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8 


5 9 13 4j+1 
a=(z)+2(3) +12(7) +150 (7) t.. 2 2g; (=) (6.7-9) 


in which the coefficients q; form the series® 


1; 2;. 15; 150; 1707; 20,910; 268,616; 3,567,400; 48,555,069; 
673,458,874; 9,481,557,398; 135,119,529,972; 1,944,997,539,623; 
28,235,172,753,886 


Thus the value of g corresponding to a particular Jacobi sextic can be obtained 
by the following sequence of steps: 
(1) The cubic equation in (6.7-6) is solved after obtaining its coefficients g2 
and 23 by means of equations 6.4—30, 6.440, and 6.441. 
(2) The roots ¢}, 2, and e3 of (6.7-6) are substituted into (6.78) to give 
L. 
(3) The value obtained for L is substituted into the infinite series (6.7—9) to 
give q. 

Equation 6.7—8 poses some difficulty since for a given ordering of the 
roots €}, €2, €3 of the cubic equation in the denominator rd equation 6.7-6, 
there are four fourth roots 5 Per and four fourth roots lees: The 16 
possible combinations of these two fourth roots reduce to four possible values 
of L when the quotient (6.7-8) is taken. In addition, there are six permutations 
of the three roots e1, 2, and ¢3 leading to (4)(6) = 24 possible q values for a 
given cubic equation. In general, half of these possible q values have lq! > 1 
and therefore are unsatisfactory since the series for the corresponding theta 
functions do not converge. The computer can check the remaining 12 possible 
values of g as well as the apparent six roots of the Jacobi sextic by investigating 
whether the product 


4 
(S — Soo) T](s — Sx) (6.7-10) 
k=0 
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agrees with the original Jacobi sextic discarding any values of q with lql <0 
which give incorrect roots. The following ambiguities also arise after 
substituting the value for g obtained from (6.7—9) into the theta series formulas 
6.6-20, 6.6-21, and 6.6—26: 


(1) The three values of (Sa) (see equations 6.6-20 and 6.6—21) differing 
by factors of exp(27i/3) need to be checked. 

(2) The q factor in equation (6.6—26) is obtained by using the fifth root of g 
in an algorithm (i.e., the 60 in the denominator of the exponent in equation 6.6— 
26) to determine the infinite sum in the equation (6.6—20) for B. Thus all five 
of the fifth roots of g must be checked. 


: 6 ; 

(3) These 15 possible choices of 4 and ( 6a)" are all used in the 
computer algorithm‘. and the apparent six roots of the Jacobi sextic have been 
checked using (6.7—10). Choices giving incorrect roots were rejected. 


6.8.. Undoing the Tschirnhausen Transformations 


Equations 6.6~20, 6.6-21, and 6.6—26 express the six roots of the 
Jacobi sextic (6.4—35) in terms of genus 1 theta functions expressed as rapidly 
converging infinite sums. In order to convert these roots to the roots of the 
original general quintic (6.2—1), the various Tschirnhausen and other 
transformations need to be undone in the following sequence: 


Jacobi Bnioschi Principal General 
sextic quintic quintic quintic 


(6.4-35) (6.3-1) (6.2—2) (6.2~1) 
In order to obtain the roots yg of the Brioschi quintic (6.3-1) from the roots of 


the Jacobi sextic (6.4-36), the following two equations are used in order to 
assure the correct selection of the sign for the outer square root in (6.4—5): 


1 
yee = 5 (Soo ~ Sk)(Sk+2 — 5k43)(Sk44 — See) (6.81) 
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21623 

et ees (6.8-2) 
Be 10 45 ne 
(1407)? + Aye? + 


The roots zg of the principal quintic are then obtained from the roots yx of the 
Brioschi quintic by using (6.3-2). Finally the roots x, of the general quintic 
(6.2-1) are obtained from the roots zz of the principal quintic (6.2—2) by the 
equation 

E + (zi - v)(u3 +Au2+Bu+C)+ (ze - v)2(2u + A) 


cei u4+Aur+Bu2+CutD + (zp - v)(3u2+2Au+B) + (zg - v)? (6.8-3) 


In order to derive equation (6.8—3) first write equation (6.24) as 


(x ~u)2 = (z—v) —~ u(x —U) (6.8—4). 
Iteratively, find equations of the form 
(x — uy = Pru, Z- v) + Qm(u, z — v)(x — Xd) (6.8-5) 


for 3 <m<5 where P,, and Q,, are polynomials. Substituting these equations 
(6.8~—5) into (6.2—11) leads ultimately to a linear equation in (v — u), which 
when solved and simplified yields the desired equation 6.8-3. 


wNe 


Chapter 7 


The Methods of Hermite and Gordan for 
Solving the General Quintic Equation 


7.1 Hermite’s Early Work on the Quintic Equation 


The previous chapter discusses in detail the Kiepert algorithm for 
solving the general quintic equation since this is the method that we have tested 
on the computer. This chapter summarizes the earlier methods of Hermite and 
Gordan for solving the general quintic equation starting with the original 1858 
work of Hermite? in this area. 

Hermite first looked at the cubic equation 

x3 ~3x+2a=0 (7.1~1) 
in which the constant term a is represented by the sine of an angle & so that the 


roots of the equation separate into the three functions 


2 sing, 2 sin, 2 sine (7.1-2) 
He then considered an extension of this approach to the Bring-Jerrard form of 
the quintic equation 
x -x-a=0 (7.1-3) 


realizing that the general quintic can be transformed to the Bring-Jerrard quintic 
by solving equations of degrees no higher than three. In this connection 
Hermite! recognized the role that elliptic transcendents can play in the solution 
of the Bring-Jerrard quintic (7.1-3) analogous to the role of the trigonometric 
functions in the solution of the cubic equation (7.1—1) (see Section 5.1). 

Let K and K" be the periods of the elliptic integral 


oa ee (7.1-4) 
1 — k*sin2o 


i.¢e., 


1C, Hermite, Sur la Résolution de I'Equation du Cinquigme Degré, 
Compt. Rend., 46, 508-515 (1858). 
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t/2 n/2 
d 
| ee ee and = K'= ~ (7.1-5) 
V 1 — k2sin2o V 1 — (k')*sin2@ 
In addition 
q= exp{- * ) (compare equation 6.6—1) (7.1-6a) 
and k2 +(k')2=1 (7.1-6b). 


The fourth root of the modulus k is related to g by the infinite series 


2m?+m 
qe yg ee (7.1-7) 


da 


Define the following: 
q = eitt (7.1-8a) 


Vk = p(t) = \ | 82(0) (compare equation 4.3-14a) (7.1—-8b) 
83(0) 
VE = w(t) =~ | sO (compare equation 4.3-14b) (7.1—8c) 
3 


The variable t = w/a is the same as in Section 4.3 but is designated as by 
Hermite! and 19th century authors discussing Hermite’s work. 
The functions @ and y have the following properties: 


8(t) + w(t) = 1 (because k? + (k')? = 1) (7.1-9a) 
onl 

oa isa .1—9b) . 
47) y(t) (7.1-9b) 

1) = exp(Z)\ 2D 71-9 
o(t+l) x08 (7.1-9c) 
(ttl) =~ (7.1-9d) 
w(t) 


Now let n be a prime number and define u and v as follows: 
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V= @(nt) andu = Q(t) (7.1—10) 
The parameters u and v are linked by an equation of degree n + 1 known as the 
modular equation, the roots of which have some special properties. Now let € 
be 1 or —1 depending on whether 2 is a quadratic residue or not a quadratic 
residue with respect to n, respectively. The n + 1 roots u of the modular 
equation have the form 


Ep(nt) and @ (a) (7.1-11) 
where m is an integer taken modulo n. Thus the modular equation of the sixth 
degree (i.e., 2 = 5) has the following form: 

u® — y6 + Sy2y2(y2 — y2) + duv(1 — u4v4) = 0 (7.1-12) 
whose six roots have the form 7.1~11 where n = 5 and m = 0, 1, 2, 3, and 4. 

The modular equation of the sixth degree (7.1—12) can be related to the 
Bring-Jerrard quintic (7.1-3). Thus consider the following function of the six 
roots of equation 7.1—-12: 


(1) = Reo i ofS) e(*S°) ae CS) be") i CI ~13) 


The five quantities B(t), P(t + 16), O(t + 32), O(t + 48), and B(t + 64) are 
the roots of a quintic equation of which the coefficients are rational in @(t), 
namely 


D5 — 2000 94 (t)yl6(1)@ — 1600 V5 o3(t)y!6(a)[1 + @8()] =0 = (7.1-14). 


This can be converted to the Bring-Jerrard quintic (7.1—3) by the substitution 


4 
& = ¥2453 o(ayw4(t)x (71-15) 
leading to 
8 
Pere een 9) ae (7.1-16). 


4 
V 552d A(x) 
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Thus in order to determine the roots of the Bring-Jerrard quintic (7.1—3) by the 
function ®(t), it is necessary to determine t or rather ((t) corresponding to the 


following: 
__2(1+98@) 
4 
V¥55¢92(cy4(0) 


This relationship is the basis of Hermite’s method for solving the general 


(7.1-17) 


quintic equation. 

The initial step of Hermite’s method is the Tschimhausen transformation 
of the general quintic equation to the Bring-Jerrard quintic (7.13), a process 
which is difficult (Section 3.3) although it requires the solution of equations of 
degrees no higher than three.2 A parameter A is then defined by the following 
equation: 

4 


av55 


A=—5 (7.1-18) 


The corresponding elliptic modulus k (equations 7.1—4 and 7.1-5) can be 
obtained by solving the quartic equation 


IA + A2k3 + 2k2 -A2k +1=0 (7.1-19). 


berawe re (71-20). 


Then the roots of equation 7.1~19 are the following: 


ro a+271 TO 3 —O 
k= tanz, tan—q—, tan, tan (7.1-21) 


The roots of the Bring-Jerrard quintic are then determined by the following 
equation (i = 0, 1, 2, 3, 4): 


2A. Cayley, On Tschirnhausen’s Transformation, Phil Trans. Roy. Soc. 
London, 152, 561-578 (1862); Collected Mathematical Papers, 
Cambridge Univ. Press, 1891, Vol IV, pp. 375-394. 
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© 16i | 
ype he FO) (7.1-22) 


qt 
4 
¥24539(t)y4(t) 


This formula is a more complicated analogue of the formula 5.1—-10b in Section 
5.1 for solution of the general cubic equation using trigonometric functions. 

Hermite also considered the problem of the numerical calculation of 
these roots noting that the infinite series defining @(t) and w(t) (equations 7.1— 
7) converge extremely rapidly even if g has an imaginary component (compare 
Section 4.3). The quantities @(t), &2(t), and ©3(t) can be estimated by the 
following infinite series in which Q = q1/5: 


8 

(1) = ¥235 YO3(1+0-02 + G3 - 805-906 +...) (7.1-23a) 
4 

2(1) = 235-103 (1 + 20 — O2 + 304 — 1805 - 3306 + 1497 + ...) (7.1-23b) 

3(t) = ¥2953-V0 (1 + 30 — 203 + 604 - 2405-79054...) (7.1-23c) 


Note that the series for ®(t), ®2(t), and @3(t) are missing the powers of Q of 
which the exponents mod 5 are 4, 3, and 2, respectively. The effect of 
changing T to T + 16m in the arguments of ® has the effect of multiplying Q by 
the diverse fifth roots of unity. 


7.2 Gordan’s Work on the Quintic Equation 


The algorithm by Hermite! for solution of the general quintic equation 
requires the difficult Tschirnhausen transformation of the general quintic 
equation to the Bring-Jerrard quintic equation (7.1-3). The Kiepert algorithm 
avoids this difficulty by using the properties of polyhedral functions to 
transform the principal quintic equation 

25 + 5az* + 5bz+c=0 (7.2-1) 
to the one-parameter Brioschi quintic equation 

y5 — 10Zy3 + 45Z2y ~ Z2 = 0 (7.2-2) 
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which can then be solved using elliptic functions (Chapter 6). In addition 
Gordan? described in 1878 an alternative method based on invariant theory 
(Section 2.6)4:5 for solving: the principal quintic equation which also avoids the 
difficult Tschirnhausen transformation to the Bring-Jerrard quintic equation. 

In order to set up his method Gordan? represents the 120 symmetry 
operations of the icosahedron by the transformations in Table 7~1 of the homo- 
geneous variables y and y2 (corresponding to u and v in Section 2.5 so that z 
of the Riemann sphere is yj/y2). The icosahedral functions (compare Section 
2D) 


f= yiy20110 + 11y15y25 — ya") (7.2-3a) 
T = y130 + 522y1?5y95 — 10005 yi20y9! — 10005 y1!0y20 — 522 yi y27° + y2°9 
(7.2~3b) 


H =—y120 + 228 yy15y5 — 494 yj !0yo10 — 228 yy5yg!5 —y220 (72-3 c) 
are not altered by the 120 permutations in Table 7-1. 


Table 7-1: The 120 Icosahedral Permutations of the 
Homogeneous Variables y; and y2 


{u,v = 0, 1, 2, 3, and 4; € = cos +i sin 


YI y2 
+ eV/2y, te-V2 yo 
Se Pe 
tel (e+e4)e-W2y1 +22} {fH Pfevy1~(ere4)eV2y2] 
e2-e3 = e223 
eh/2[e-vPy-(ere4)eV/y2] cHef(eretyeW2y +e/?y2] 
gl<3 7 e2-¢3 


a aoe enna 


3p, Gordan, Uber die Auflésung der Gleichungen vom fiinften Grade, 
Math. Ann., 13, 375-404 (1878).. 

4). H. Grace and A. Young, The Algebra of Invariants, Cambridge, 
1903. 

50. E. Glenn, A Treatise on the Theory of Invariants, Ginn and Co., 
Boston, 1915. 
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The substitutions in Table 7-1 also form an icosahedral group if a 
complex root of unity other than € is used. If e4 is used, then the new 


substitution system goes back to the original one if ~y2 and y, are substituted 
for y; and y2, respectively. However, use of €2 generates a new set of 


substitutions represented in Table 7—2 by the variables x1 and x9. 


Table 7-2: The 120 Icosahedral Permutations of the 
Homogeneous Variables x; and x2 


{u,v = 0, 1, 2, 3, and 4; € = cos 2n isin aa 


5 
xX] x2 
+ eYx] te-Vx9 
= eYx2 + e-Vxy 
teh [(e2+e3)e-Vx] +e¥x9] ef He Vxie7+23)eVx2] 
ete 7 et_¢ 
tel [e-Vx1-(e2+e3) eV x9] pf M(e2+e3)eVx te¥x2] 
ete 7 ete 


Gordan? now seeks the lowest degree homogeneous entire function of 
¥1,¥2,¥1,X2 which remains unchanged when yj and y2 are subject to the 
permutations in Table 7~1 and x1 and x2 are subject to the permutations in Table 
7-2. This function turns out to be 

§ = yitxi2x2 + yi2yaxo3 + yiy22x13 — yo3xix22 (7.2-4) 
This function is designated in this book by € rather than the f used by Gordan to 


| 
es avoid confusion with the f used for the icosahedral vertex function (e.g., 
| 


PELE NEE Lei Mork ME 


equation 7.2—3a). 
Gordan next derives a system of covariants of & by using a generalized 


version of the transvection process (Section 2.6) leading to the following 
functions in x1,x2,y1,)2: 
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Degree 8: 
@ = 4(E E11 = y1401123 — yiByox14 — 3y12y22x 42x22 + yiy23x2t + yotx13x2 
(7.2—5a) 
Degree 10 
y = 12(E,@)1,1 = yi5(415+x25) — lOyy4yox13x22+ 10y 13 y22x1x24+ 
10y12y23x14x2 + 1Oyiy24x12x23 + y25(—119+22°) (7.2-5b) 


The functions @ and y, like the function € from which they are obtained by the 
indicated transvections, are unaltered by the 120 icosahedral substitutions 
(Tables 7-1 and 7-2). 

Gordan} now uses these functions of icosahedral symmetry as the 
coefficients of a principal quintic equation as follows: 


19 + SEX? — Sex — y= 0 2-0) 
A root of this equation can be expressed in the following form: 

X =~yi xy + x2) + yor — x2) (7.2-7) 
The other roots then have the form 

Xv =—eYxyy1 + €2¥x1y2 — EF¥xoy) —EWVxgy2 (7.2-8) 


where € is a complex fifth root of unity. 

The roots of equation (7.26) can also be related to the icosahedral 
polynomials f, H, and T (equations 7.2—3) as well as the polynomials fy and 
Wy of the five suboctahedra of the icosahedron defined as the midpoints of its 
edges (compare Section 6.3) by the following polynomials in x1 and x2 (or 
analogously y; and y2): 


ty = €3Vx19 + 2e2Vxy 5x9 — SeVxy 4x22 — Se4Vxy2x94— 2e3 Vx 1x95 + €2¥x26 
(7.2—9a) 
Wy = 4 Vx 18 + E35 ¥x 17 x9 — Te2Vx 6x92 — JeYx 15x23 + Te4¥x 13x95 — 
Te3 Vx 12x96 — €2¥x 1x97 — E%x28 (7.2—9b) 


The parameter c is obtained by solving the following quadratic equation: 


= 4_ 3 ; 
(Ey — 8 92)c2 — (E29 — Myy2)e +A = 89" + SOW) _ 9 (7.2-10) 
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Equation 7.2-10 plays an analogous role in the Gordan algorithm to the 
quadratic equation in A (equation 6.3—28) used in the Kiepert algorithm. The 
parameter c is the value of a function that can be derived from € and @ by 
further transvection processes and thus is invariant under icosahedral 
substitutions (Tables 7—1 and 7-2). The quantities €, @, y, and c are then used 
to calculate the parameters A, B, C, and D by the following equations: 


_ 8&9 + 3cew (7.2-11a) 
E2 + 3cp 
—Ry2 
pga ach (7.2-11b) 
E2 + 3co 
aon 2 
Ca 8er + Hoy + 3cy* (7.2-116) 
E24 3co 
2 
p = ABA + 3cAB + 3068 (7.2-114) 


? 


The roots of the principal quintic (7.2—8) have the following form in terms of 
the polyhedral functions of the icosahedron and its five suboctahedra: 


WS Gi Wy - ta WyWy (7.2-12) 


Note the similarity of equation 7.2-12 to the analogous equation (6.3—8) in the 
Kiepert algorithm. 

Gordan? obtains an equation analogous to the Brioschi quintic (6.3-7) 
by calculating its variable u from the equation 


3% + AZ 
AX +BZ 


u=—VB (7.2~13) 
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An analogue in Gordan’s method to the Brioschi quintic of Kiepert’s algorithm 
then has the form 


Pe eee Cee (7.214) 


vp 


Equation (7.2-14) can then be related to the Brioschi quintic (6.3—7) by the 
change of variables t = uf. 

Gordan’s method for solution of the general quintic equation thus first 
uses the standard Tschirnhausen transformation for conversion of the general 
quintic equation to the principal quintic (Section 6.2), which is now expressed 
as equation (7.2-6). His method can then deliver a Brioschi-like quintic 
(7.2-14) similar to the portion of the Kiepert algorithm discussed in Section 
6.3. In order to obtain the actual roots of the principal quintic (7.2-6) the 
elliptic modulus k? is calculated from the following algebraically soluble sextic 
equation in k?: 


2 43 Di 3 
_ a i )? 4A oe (7.2-15) 
The solution of equation 7.2—15 to determine the elliptic modulus k2 requires 
solution of a cubic equation just like the Kiepert algorithm requires solution of 
the cubic equation in equation (6.7—6), namely 
4x3 — gox — 23 =0 (7.2-16) 
from which the modulus k2 is obtained by the quotient 


q £2 —-€3 a 
k area | (7.2-17) 


where é1, €2, and €3 are the three roots of the cubic equation 7.2—16. 

The remainder of Gordan’s solution of the quintic equation uses many 
of the methods of Hermite (Section 7.1). Thus the following relationship is 
used to obtain g and more significantly w: 
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2m2+m 
E-ow= Gee | (7.2-18) 


de 


Then the roots of a Bring-Jerrard like quintic are obtained from the equation 
derived by Hermite, namely 


= 213 Ye 9(5a) + o(2™| 
of Se) - ‘ (exes) 


es 5 


| 
x [9 (exigee) 9 (Hse) 


(7.2-19) 


The roots of the original principal quintic (7.2—-6) can be obtained by substi- 
tuting the roots Ay and the elliptic moduli & and k’ into equation 7.2~12 leading 
to 


no Ak 21+k2)hy | 4Vk ae 
ee ESSE og ON ERY) 


A(1+14k24+k4)| Vk QhyVk+k2+1 
_ Dk2(1-k2)2 BVk hy3+2khy (7.2-20) 
A(1414k2444)(1-34k2444) | 14k2 VK (2hyVk +k241) 


The roots of the general quintic equation can then be determined from the roots 
of the principal quintic (7.2-20) by reversing the original Tschirnhausen 
transformation from the general quintic to the principal quintic by equation 
(6.8—3). 


Chapter 8 
Beyond the Quintic Equation 
8.1 The Sextic Equation 


The theory of the general sextic (sixth degree) equation was discussed 
by Cole’ in 1886 within a decade of the publication of the Kiepert2 and 
Gordan? algorithms for solution of the quintic equation. This theory extends 
ideas found to be useful for equations of degrees 5 and lower to the sextic 
equation. | 

A central feature of the theories of equations one to five is the role 
played played by certain groups of linear transformations of a single variable 
corresponding to the Galois groups of the equations. The quartic equation is 
the lowest degree equation in which the group of linear transformations plays a 
role, which, however, is not apparent in the method for solution of the quartic 


equation discussed in Section 5.2. The four roots of the quartic equation have 
six anharmonic ratios, namely 1, e 1~A, os , fad , and 
1~A A A-1 
discussed in Section 4.1 (see particularly equations 4.1-13 to 4.1-19). These 
six anharmonic ratios are roots of a sextic equation, which is not the general 
sextic equation but is characterized by the fact that every root is a rational linear 
fraction of every other root. This sextic equation reduces to the resolvent cubic 
equation (5.26) found in the solution of the quartic equation discussed in 
Section 5.2. | 
The corresponding group of linear transformations z — z' for the 


general quintic equation corresponds to the 60 operations of the alternating 


as 


1Cole, F. N., A Contribution to the Theory of the General Equation of 
the Sixth Degree, Amer. J. Math.,8, 265-286 (1886). 

2L. Kiepert, Auflésung der Gleichungen fiinften Grades, J. fiir Math. 
87, 114-133 (1878). 

3P, Gordan, Uber die Auflésung der Gleichungen vom fiinften Grade, 
Math. Ann., 13, 375-404 (1878). 
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group As, which can be represented by the following equation where z is an 
integer congruent to 0, 1, 2, 3, 4, or ce mod 5: 


az+ 8 


8.1-1 
yztd : 


N 
Ul 


By proper choice of a, B, y, and 5, z' is an integer. It turns out that there are 
60 sets of values for a, 8, y, and 5 that satisfy these conditions. Selection of 
any of these sets and putting z successively congruent to 0, 1,..., 4, and e mod 
5 leads to z' values congruent to 0,...,°0 but in a different order from the 
original. Thus a rational function of the roots of the quintic equation can be 
found which, when these roots undergo any even permutation, is linearly 
transformed by the formula 


oz oe z 
eer w (8.1-2) 


The function @ plays an analogous role in the theory of the quintic equation to 
the anharmonic ratio A in the theory of the quartic equation. The function o 
satisfies an equation of degree 60, which is related to the icosahedral 
polynomials (Section 2.5), which are involved in the solution of the general 
quintic equation (Sections 6.3 and 7.2). There is the important difference that 
the equation in A is simpler than the general quartic equation, i.e., is reducible 
to a cubic and quadratic equation, whereas the equation in 9 still has the same 
As Galois group of the original quintic equation. 

Extension of such ideas to the general sextic equation requires finding a 
group of linear transformations isomorphic with the full symmetric group 56 
with 6! = 720 operations or the corresponding alternating group Ag with 61/2 = 
360 operations. Klein4 has shown that there is no such finite group of linear 
transformations of a single variable. The smallest number of variables to 
represent the symmetric S¢ group through linear substitutions is three, which 
becomes four if the linear transformation is written in homogeneous form. Of 


4F Klein, Vorlesungen iiber das Ikosacder, Teubner, Leipzig, 1884. 
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these four homogeneous variables, the ratio of three to the fourth will then be 
transformed by a nonhomogeneous linear transformation. This group of 
transformations is known under the geometrical form in which this theory has 
been treated in connection with a surface of fourth order and class known as 
Kummer’s surface.5 

The inability to represent the Sg Galois group of the general sextic 
equation as a linear transformation of a single variable is a new complication 
that arises when the degree of an algebraic equation increases from 5 to6. A 
consequence of this is that the elliptic functions derived from elliptic integrals of 
the type 


fx) = (—& 


J Px) 


in which #(x) is a polynomial of degree 3 or 4 (Chapter 4) are not sufficient for 


(8.1-3) 


the solution of general equations of degrees 6 or higher. Hyperelliptic integrals 
of the type (8.1—3) in which f(x) is a polynomial of degree 5 or 6 are necessary 
for solution of the general sextic equation. These integrals are related to double 
theta functions (Section 4.4). 

Bolza® and Maschke? have published back-to-back papers in Mathemat- 
ische Annalen that provide a basis for solving the general sextic equation using 
double theta functions. Solution of the general sextic equation by this method is 
also discussed by Brioschi® along with the covariants and invariants of the 
general sextic equation. 

The general sextic equation 


x8 + ayx5 + anx4 + ayx3 + agx2 + asx +6 =0 (8.1-4) 


SR. W. H. T. Hudson, Kummer’s Quartic Surface, Cambridge, 1905. 

6Q. Bolza, Darstellung der Rationalen ganzen Invarianten der 
Bindrform sechsten Grades durch die Nullwerthe der zugehérigen @- 
Functionen, Math. Ann., 30, 478-495 (1887). 

7H. Maschke, Uber die quaternare, endliche, lineare 
Substitutionsgruppe der Borschardt’schen Moduln, Math. Ann., 30, 
496-515 (1887). 

8F. Brioschi, Sur I’Equation du Sixigme Degré, Acta Math., 12, 83-101 
(1888). 


REE EE ae OR TP NS EE he NE AS Tt eR Re ad he oT alt Ae poareet ee eee CSTE. Letina a are eie e wee re oe ae eee ee 
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can be converted to the special Maschke sextic equation 


D(y) = 6 — 6F gy4 + 4F y2y3 + 9Fg2y2 — 12F a0y + 4F 24 

= (9y3 — 3Fgy + 2F 12)? + 12(FgF 12 - F20)y - 4(F 12? — F2a) 

=0 (8.1-5) 
using a Tschirnhausen transformation requiring solution of a quartic equation so 
that the necessary relationship can be found between the coefficients of y4 and 
y2. The coefficients Fg, F12, F29, and F24 determine the invariants of a binary 
sextic $(x1,x2). The periods 141, t12, and t22 from the corresponding 
hyperelliptic integral 


a 
d(x 1/x2) 
f VS(x1/x2,1) 


u(a) = (8. 1-6) 


are substituted into the following four even double theta functions of zero 
argument (Section 4.4) to give the Borchardt moduli 


21 = (7) (01214 1,212,222) (8.1-7a) 
22 = 8( 99) (01241 1,2%12,2t22) (8.1-7b) 
23 = O( Gp )(Ol2t11,2t12,220) (8.1-7c) 
24 = 0( $3 (01244 1,2012,2t22) (8.1~7d) 


These Borchardt moduli provide the equation of a Kummer surface? 


> + Aw + By3 + Cw4 + 2Dy =0 (8.1~8) 
where | 

b = 214 + 294 + 234 + 244 (8.1—9a) 

Wo = 212292 + 232242 (8.1—9b) 

W3 + 212232 + 292242 —— (8.1-9e) 

W4 = 212242 + 292232 (8.1—9d) 

X = 21222324 (8.1—9e) 


The roots of the Maschke sextic (8.1—5) are then 
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yi = > + 6(-W2 — W3 — Wa) (8.1-10a) 
y2 = > + 6C-y2 + 3 + Wa) (8.1—10b) 
y3 = > + 6(W2 — Y3 + Wa) (8.1~10c) 
¥4 = + OC 2 + 3 - Wa) (8.1—10d) 
y5 = 2h — 24x (8.1-10e) 
¥6 = 2 + 24x (8.1-10f) 


Reversing the Tschirnhausen transformation converting the general sextic 
equation (8.1—4) to the Maschke sextic equation (8.1—5) can then give the roots 
of the original general sextic equation (8.1—4). 


8.2. The Septic Equation 


Table 2—3 lists seven possible transitive permutation groups of degree 7. 
All of these groups are possible Galois groups for irreducible septic equations 
(equations of degree 7). The most interesting group in this list is the group 
designated as L(3,2) of order 168. This is the smallest simple group which is 
not an alternating group, A, (# 25). Thus septic equations can be classified 
into the following three general types: 
(1) — Special septic equations soluble by radicals—namely equations with the 
cyclic, dihedral, and metacyclic groups of orders 7 (C7), 14 (D7), 21, and 42 
(M7). The groups of orders 7, 14, and 21, are all subgroups of the metacyclic 
group. 
(2) Septic equations with the simple Galois group L(3,2). Such equations 
require elliptic but not hyperelliptic functions for their solution and are 
discussed in this section. The L(3,2) group can also be a permutation group of 
degree 8 and is associated with the modular equation of degree 8 (see Section 
7.1). The theory for solving septic equations with the L(3,2) Galois group is 
closely related to the theory for solving the general quintic equation (Chapters 6 
and 7) although necessarily significantly more complicated. 
(3) General septic equations with the alternating or symmetric Galois 
groups A7 or $7. Such equations, like the general sextic equation (Section 
8.1), require hyperelliptic functions and associated theta functions of genus 3 
(Section 4.4) for their solution. General septic equations do not appear to have 
been studied specifically by the nineteenth century mathematicians studying the 


| 
| 


144 Beyond the Quartic Equation 


solution of algebraic equations, apparently because solution of the general sextic 
equation was already at the limits of feasibility in that noncomputer era. 

The alternating group As of order 60 used for solution of the general 
quintic equation can be modelled geometrically by the proper rotations of a 
regular icosahedron, i.e., the symmetry point group J (Sections 2.2 and 2.3). 
The ability of As =/ to function as an even permutation group on five objects is 
then represented by the permutations of the five suboctahedra generated from 
the midpoints of the 30 edges of the underlying icosahedron (Figure 2-5).? 
Figure 8-1 depicts analogous geometrical models for the L(3,2) group of order 
168 which reflect its function as a permutation group of degree 7 for the special 
septic equation and as a permutation group of degree 8 for the modular equation 
of degree 8 as follows: 

Degree 7: An equilateral triangle with its three altitudes and an inscribed 
circle forms a 7-point 7-line geometry presented in D3 symmetry. The 
permutations of the 7 vertex labels which preserve the 7 collineations (152, 
263, 173, 146, 247, 345, 567) form the L(3,2) group. Note that in this 
presentation the inscribed circle is treated on an equal basis with the six straight 
lines forming the three edges and the three altitudes of the triangle. 

Degree 8: The permutations of the vertex labels of a cuboid (rectangular box) 
of D2 point group symmetry which give non-superimposable cuboids form the 
L(3,2) group. 


/ a 
-—_—/| 
m3 
/ / 
3 6 
D3 7-point 7-line Dz» cuboid 
geometry 8 points 


Figure 8-1: Geometrical models of the L(3,2) group as a permutation group on 7 points 
and on 8 points. 


9F. J, Budden, The Fascination of Groups, Cambridge, 1972, pp. 272- 
277, p. 411, and pp. 495-496. 
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The L(3,2) group has an associated series of polynomials connected by 
their transvectants analogous to the polyhedral polynomials discussed in 
Sections 2.5 and 2.6 (e.g., equations 2.5~9 defining f, T, and H for the 
icosahedron). However, in the case of the L(3,2) group these polynomials are 
ternary forms with three homogeneous variables rather than the binary forms 
found in the polyhedral polynomials discussed in Section 2.5. 

The polynomials associated with the L(3,2) group were 
developed!9.11,12 in connection with the seventh order transformation of 
elliptic functions relating to the modular equation of degree 8. The homo- 
geneous ternary quartic 


f =23p + p3v + v30 (8.2-1). 


defines a fourth order curve by the relation f = 0. The Hessian of equation 
8.2—1 leads to a ternary sextic V, i.e., 


ef of oF 
aa2 dAdp aAdv 


2f 2 ZF 
— 34 sah oO shy = 5 A2p2y2-(aSv+v5p4+ pA) (8.22) 


af of of 
avdA avon av2 


Related methods can be used to define C and K of degrees 14 and 21 as 
follows: 


10F, Klein, Uber die Transformation siebenter Ordnung der 
elliptischen Funktionen, Math. Ann., 14, 428-471 (1879); Gesammelte 
Mathematische Abhandlungen, Springer Verlag, Berlin, 1923, Vol. 3, 
pp. 90-136. 

1lp. Gordan, Uber die typische Darstellung der ternaren 
biquadratischen Form f = x13x2 + x23x3 + x39x1, Math. Ann.,14, 359- 
378 (1880). 

12H, Weber, Lehrbuch der Algebra, Vieweg, Braunschweig, 1898, 
Volume 2, §§ 131-140. 


146 Beyond the Quartic Equation 


ef #f Pf wW 
QA2 OAdE OAV OA 


Pf Pf SF ow 


C 1 | HOA op2 dapdv op 
91 ate ate af av 
OvOA dvot ave av 

vw a wW 4 

Fy ar St 


= £14 — 34 ApvdAOy — 250 AuvEA8 3 + 375 A2p2v2D1 512 
~126 A3p3V3DA3p2 + 18 SAT 7 (8.2~3) 


of a aC 
dA dk OA 


-v-t| af Ww ac 
Naa on On op 


of OV aC 
dv av ov 


= YA21 —7 ApvdA! 7p + 217 ApvdAlSy3 + 1638 Apvd10u8 
— 308 A2p2v2SA 132 — 6279 A2p2v2d A916 + 637 A3p3v3F1 93 
+ 4018 A3p3v35110y2 10010 A4u4v4 FAS 4 + 7007 ASpSvSEASv 
— 57147 + pl4y7 + vl4,7) — 2897014 v7 + 1407 + vl47) + 3432 ATT V7 
(8.2—4) 
The summation sign, », in equations 8.2—3 and 8.24 means a sum of the three 
terms obtained from the first term by cyclic permutations of A, UL, and v; these 
equations are taken directly from Gordan’s paper.!! The homogeneous ternary 
polynomials V, C, and K satisfy a degree 42 identity 
1728 (-V)? ~C3 + K2=0 (8.2—5) 
closely related to the degree 60 icosahedral identity (compare equation 2.5—12c) 
1728 f5 - H3 -12=0 | (8.2-6). 


lp kA Ro ge ST 
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Radford!3 has discussed the solution of septic equations with the 
L(Q3,2) Galois group. Such equations are expressed by both Klein!9 and 
Radford!3 in the general form 


54-7 
C1 + To(-1 ¢ V-T)Vc4 — ut +5 }¥2c -~C=0 (8.2~7) 


in which the parameters V and C are defined by equations 8.2—2 and 8.2-3, 
respectively. The complex coefficients in equation 8.2~7 can be eliminated by a 
suitable change in variable although this does not appear to have been worked 
out in detail, at least in the 19th century mathematical literature. Equation 8.2—7 
resembles one form of the Brioschi quintic equation, namely 

 — 10/8 + 45f1-T=0 (8.2-8) 
in which V and C in equation 8.2—7 play analogous roles to f and T, respec- 
tively, in equation 8.2~8. The octic (degree 8) equation corresponding to 
equation 8.2—7, which also has an L(3,2) Galois group, but now acting on 
eight roots corresponding to the cuboid vertices in Figure 8—1, has the form 

58 — 14V56 + 63V284 — 70V382 — Ki —-7V4 =0 (8.2-9) 
This is an octic analogue of the following Jacobi sextic (Compare equation 
6.4~1): 

56 — 10fs3 + Hs + 5f2 =0 (8.2-10) 
Radford!3 has expressed the roots of the octic equation (8.2~9) in product or 
sums related to theta series (Section 4.3) analogous to the formulas given in 
Section 6.6 for the Jacobi sextic (8.210) derived from the general quintic 
equation. 


8.3. The General Algebraic Equation of Any Degree 


Jordan!4 has shown that any algebraic equation can be solved using 
modular functions. The basis of such solutions involves Thomae’s 


13—, M. Radford, On the Solution of Certain Equations of the Seventh 
Degree, Quart. J. Math., 30, 263-306 (1898). 


14C. Jordan, Traité des Substitutions et des Equations Algébriques, 
Gauthiers-Villars, Paris, 1870, 
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formulas'5.16,17, The following theorem discussed by Umemura!8 provides a 
basis for expressing the roots of any algebraic equation by higher genus theta 
functions (Section 4.4): . 


Theorem 8.3-1: 
Let f(x) = agx” + ax"! +... ay =0 with ap #0 (8.3-1) 
be an algebraic equation irreducible over a certain subfield of the complex 


numbers. Then a root of this equation, xx, can be expressed by the following 
equation involving theta functions of zero argument: 


4 4 4 
af -[0(32:58)e]]o(4t858)e| +{0(¢.8)a Cem 


o(23:3)]] (2329) 0] 
Baloo | aca 
10(4°.58)e0]]o(61°--2)e 


In equation 8.3—2 Q is the period matrix derived from one of the following 


(8.3-2) 


hyperelliptic integrals: 
a 


_ (__ ie 
u(a) = WeclfG) for n odd (i.e., odd degree of f(x) | (8.3—3a) 


or 


15J, Thomae, Beitrag zur Bestimmung von @(0,0,...,0) durch die 
Klassenmoduln algebraischer Functionen, J. fiir Math,71, 201-222 
(1869). 

16f, Lindemann, Uber die Auflésung algebraischer Gleichungen 
durch transcendente Functionen I, II, Géttingen Nach., 245-248 
(1894); 292-298 (1892), 

17), Mumford, Tata Lectures on Theta I, Birkhauser, Boston, 1983. 
18H, Umemura, Resolution of Algebraic Equations by Theta Constants, 
in Tata Lectures on Theta Il, D. Mumford, ed., pp. 3.261-3.272, 
Birkhauser, Boston, 1984. 
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a 
= dx : } oe 
u(a) = See DI for n even (i.e., even degree of f(x)) (8.3—3b) 


This theorem applies to algebraic equations in any form without the need to use 
Tschirnhausen or other transformations to bring the original algebraic equation 
into a special form such as the Brioschi or Bring-Jerrard normal forms of the 
quintic equation. However, application of this theorem in practice is very 
difficult because of the complexity of the relevant hyperelliptic integrals and 
higher genus theta functions. 


